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Preface 


The most extensive changes in this edition occur in the segment of the book devoted to 
commutative algebra, especially in Chapter 7, Commutative Ideal Theory: General Theory and 
Noetherian Rings; Chapter 8, Field Theory; and Chapter 9, Valuation Theory. In Chapter 7 we 
give an improved account of integral dependence, highlighting relations between a ring and its 
integral extensions (“lying over,” “going-up,” and “going-down” theorems). Section 7.7, 
Integrally Closed Domains, is new, as are three sections in Chapter 8: 8.13, Transcendency 
Bases for Domains; 8.18, Tensor Products of Fields; and 8.19, Free Composites of Fields. The 
latter two are taken from Volume III of our Lectures in Abstract Algebra (D. Van Nostrand 
1964; Springer-Verlag, 1980). The most notable addition to Chapter 9 is Krasner’s lemma, 
used to give an improved proof of a classical theorem of Kurschak’s lemma (1913). We also 
give an improved proof of the theorem on extensions of absolute values to a finite dimensional 
extension of a field (Theorem 9.13) based on the concept of composite of a field considered in 
the new section 8.18. 

In Chapter 4, Basic Structure Theory of Rings, we give improved accounts of the 
characterization of finite dimensional splitting fields of central simple algebras and of the fact 
that the Brauer classes of central simple algebras over a given field constitute a set—a fact 
which is needed to define the Brauer group Br(F). In the chapter on homological algebra 
(Chapter 6), we give an improved proof of the existence of a projective resolution of a short 
exact sequence of modules. 

A number of new exercises have been added and some defective ones have been deleted. 

Some of the changes we have made were inspired by suggestions made by our colleagues, 
Walter Feit, George Seligman, and Tsuneo Tamagawa. They, as well as Ronnie Lee, Sidney 
Porter (a former graduate student), and the Chinese translators of this book, Professors Cao Xi- 
hua and Wang Jian-pan, pointed out some errors in the first edition which are now corrected. 
We are indeed grateful for their interest and their important inputs to the new edition. Our 
heartfelt thanks are due also to F. D. Jacobson, for reading the proofs of this text and especially 
for updating the index. 


January 1989 Nathan Jacobson 


Preface to the First Edition 


This volume is a text for a second course in algebra that presupposes an introductory course 
covering the type of material contained in the Introduction and the first three or four chapters of 
Basic Algebra I. These chapters dealt with the rudiments of set theory, group theory, rings, 
modules, especially modules over a principal ideal domain, and Galois theory focused on the 
classical problems of solvability of equations by radicals and constructions with straight-edge 
and compass. 

Basic Algebra Il contains a good deal more material than can be covered in a year’s 
course. Selection of chapters as well as setting limits within chapters will be essential in 
designing a realistic program for a year. We briefly indicate several alternatives for such a 
program: Chapter | with the addition of section 2.9 as a supplement to section 1.5, Chapters 3 
and 4, Chapter 6 to section 6.11, Chapter 7 to section 7.13, sections 8.1—8.3, 8.6, 8.12, Chapter 
9 to section 9.13. A slight modification of this program would be to trade off sections 4.6—4.8 
for sections 5.1—5.5 and 5.9. For students who have had no Galois theory it will be desirable 
to supplement section 8.3 with some of the material of Chapter 4 of Basic Algebra I. If an 
important objective of a course in algebra is an understanding of the foundations of algebraic 
structures and the relation between algebra and mathematical logic, then all of Chapter 2 
should be included in the course. This, of course, will necessitate thinning down other parts, 
e.g., homological algebra. There are many other possibilities for a one-year course based on 
this text. 

The material in each chapter is treated to a depth that permits the use of the text also for 
specialized courses. For example, Chapters 3, 4, and 5 could constitute a one-semester course 
on representation theory of finite groups, and Chapter 7 and parts of Chapters 8, 9, and 10 
could be used for a one-semester course in commutative algebras. Chapters 1, 3, and 6 could 
be used for an introductory course in homological algebra. 

Chapter 11 on real fields is somewhat isolated from the remainder of the book. However, it 
constitutes a direct extension of Chapter 5 of Basic Algebra I and includes a solution of 
Hilbert’s problem on positive semi-definite rational functions, based on a theorem of Tarski’s 
that was proved in Chapter 5 of the first volume. Chapter 11 also includes Pfister’s beautiful 
theory of quadratic forms that gives an answer to the question of the minimum number of 
squares required to express a sum of squares of rational functions of n real variables (see 
section 11.5). 

Aside from its use as a text for a course, the book is designed for independent reading by 


students possessing the background indicated. A great deal of material is included. However, 
we believe that nearly all of this is of interest to mathematicians of diverse orientations and not 
just to specialists in algebra. We have kept in mind a general audience also in seeking to 
reduce to a minimum the technical terminology and in avoiding the creation of an overly 
elaborate machinery before presenting the interesting results. Occasionally we have had to pay 
a price for this in proofs that may appear a bit heavy to the specialist. 

Many exercises have been included in the text. Some of these state interesting additional 
results, accompanied with sketches of proofs. Relegation of these to the exercises was 
motivated simply by the desire to reduce the size of the text somewhat. The reader would be 
well advised to work a substantial number of the exercises. 

An extensive bibliography seemed inappropriate in a text of this type. In its place we have 
listed at the end of each chapter one or two specialized texts in which the reader can find 
extensive bibliographies on the subject of the chapter. Occasionally, we have included in our 
short list of references one or two papers of historical importance. None of this has been done 
in a systematic or comprehensive manner. 

Again it is a pleasure for me to acknowledge the assistance of many friends in suggesting 
improvements of earlier versions of this text. I should mention first the students whose 
perceptions detected flaws in the exposition and sometimes suggested better proofs that they 
had seen elsewhere. Some of the students who have contributed in this way are Monica 
Barattieri, Ying Cheng, Daniel Corro, William Ellis, Craig Huneke, and Kenneth McKenna. 
Valuable suggestions have been communicated to me by Professors Kevin McCrimmon, James 
D. Reid, Robert L. Wilson, and Daniel Zelinsky. I have received such suggestions also from my 
colleagues Professors Walter Feit, George Seligman, and Tsuneo Tamagawa. The arduous task 
of proofreading was largely taken over by Ying Cheng, Professor Florence Jacobson, and 
James Reid. Florence Jacobson assisted in compiling the index. Finally we should mention the 
fine job of typing that was done by Joyce Harry and Donna Belli. I am greatly indebted to all of 
these individuals, and I take this opportunity to offer them my sincere thanks. 


January 1980 Nathan Jacobson 


Introduction 


In the Introduction to Basic Algebra I (abbreviated throughout as “BAT’) we gave an account 
of the set theoretic concepts that were needed for that volume. These included the power set # 
(S) of a set S, the Cartesian product S, x S, of two sets S| and S), maps ( = functions), and 


equivalence relations. In the first volume we generally gave preference to constructive 
arguments and avoided transfinite methods altogether. 

The results that are presented in this volume require more powerful tools, particularly for 
the proofs of certain existence theorems. Many of these proofs will be based on a result, called 
Zorn’s lemma, whose usefulness for proving such existence theorems was first noted by Max 
Zorn. We shall require also some results on the arithmetic of cardinal numbers. All of this fits 
into the framework of the Zermelo—Fraenkel axiomatization of set theory, including the axiom 
of choice (the so-called ZFC set theory). Two excellent texts that can be used to fill in the 
details omitted in our discussion are P. R. Halmos’ Naive Set Theory and the more substantial 
Set Theory and the Continuum Hypothesis by P. J. Cohen. 

Classical mathematics deals almost exclusively with structures based on sets. On the other 
hand, category theory—which will be introduced in Chapter 1—deals with collections of sets, 
such as all groups, that need to be treated differently from sets. Such collections are called 
classes. A brief indication of a suitable foundation for category theory is given in the last 
section of this Introduction. 


).1 ZORN’S LEMMA 


We shall now formulate a maximum principle of set theory called Zorn’s lemma. We state this 
first for subsets of a given set. We recall that a set C of subsets of a set S (that is, a subset of 
the power set A(S)) is called a chain if C is totally ordered by inclusion, that is, for any A, B € 
C either A c B or BCA. A set T of subsets of S is called inductive if the union Ua, of any 
chain C= {A,} C Tis a member of T. We can now state 


ZORN’S LEMMA (First formulation). Let T be a non-vacuous set of subsets of a set S. 
Assume T is inductive. Then T contains a maximal element, that is, there exists anM e T 
such that no A € T properly contains M. 


There is another formulation of Zorn’s lemma in terms of partially ordered sets (BAI, p. 
456). Let P, > be a partially ordered set. We call P, > (totally or linearly) ordered if for every 
a,b e P either a È> b or b > a. We call P inductive if every non-vacuous subset C of P that is 
(totally) ordered by > as defined in P has a least upper bound in P, that is, there exists a u € P 
such that u > a for every a € C and ifv > a for every a € C then v > u. Then we have 


ZORN’S LEMMA (Second formulation). Let P, > be a partially ordered set that is 
inductive. Then P contains maximal elements, that is, there exists m € P such that noa e P 
satisfies m <a. 


It is easily seen that the two formulations of Zorn’s lemma are equivalent, so there is no 
harm in referring to either as “Zorn’s lemma.” It can be shown that Zorn’s lemma is equivalent 
to the 


AXIOM OF CHOICE. Let S be a set, AS)* the set of non-vacuous subsets of S. Then there 
exists a map f (a “choice function”) of Z(S)* into S such that f(A) € A for every A € I(S)*. 


This is equivalent also to the following: If {4,} is a set of non-vacuous sets 4,, then the 
Cartesian product [14.4 Ø. 

The statement that the axiom of choice implies Zorn’s lemma can be proved by an argument 
that was used by E. Zermelo to prove that every set can be well ordered (see Halmos, pp. 62— 
65). A set S is well ordered by an order relation > if every non-vacuous subset of S has a least 
element. The well-ordering theorem is also equivalent to Zorn’s lemma and to the axiom of 
choice. We shall illustrate the use of Zorn’s lemma in the next section. 


).2 ARITHMETIC OF CARDINAL NUMBERS 


Following Halmos, we shall first state the main results on cardinal arithmetic without defining 
cardinal numbers. We say that the sets A and B have the same cardinality and indicate this by 
|A| = |B| if there exists a bijective map of A onto B. We write |A| < |B| if there is an injective map 
of A into B and |A| < |B| if |A| < |B| and |A| 4 |B|. Using these notations, the Schroder-Bernstein 
theorem (BAI, p. 25) can be stated as: |A] < |B| and |B| < |A| if and only if |A| = |B|. A set F is 
finite if |F| = |N| for some N = {0, 1, ..., n— 1} and A is countably infinite if |A| = |o| for œ = 
{0, 1, 2, ...}. It follows from the axiom of choice that if A is infinite ( = not finite), then || < 
|A|. We also have Cantor’s theorem that for any A, |A| < |#(A)|. 

We write C= Aw B for sets A, B, Cif C= AU B and AN B= ¢. It is clear that if |A,| < |A)| 


and |B,| < |B], then |4; ù B,|< |4, ù B3]. Let C= Fù œ where F is finite, say, F = {xọ, ..., x 


Sa p 
1} where x; # x; for i + j. Then the map of C into œ such that x; ~ i, k ~ k +n fork € @ is 
bijective. Hence |C| = |e]. It follows from |@| < |A| for any infinite A that if C= F ù A, then |C| = 
|A|. For we can write A = D ù B where |D| = ||. Then we have a bijective map of F U D onto 
D and we can extend this by the identity on B to obtain a bijective map of C onto A. 


We can use the preceding result and Zorn’s lemma to prove the main result on “addition of 
cardinals,” which can be stated as: If A is infinite and C= A U B where |B| = |M], then |C| = A]. 
This is clear if A is countable from the decomposition œ = {0, 2, 4, ...} © {1, 3,5, ...}. It is 
clear also that the result is equivalent to |A x 2| = |A| if 2 = {0, 1}, since |Æ x 2| = |A ù B|. We 
proceed to prove that |A x 2| = |A| for infinite 4. Consider the set of pairs (X, f) where X is an 
infinite subset of A and f is a bijective map of X x 2 onto X. This set is not vacuous, since A 
contains countably infinite subsets X and for such an X we have bijective maps of X x 2 onto X. 
We order the set {(X, f)} by (X, ) < (Xf) if X c X' and f' is an extension of f. It is clear that 
{(X, f)}, < is an inductive partially ordered set. Hence, by Zorn’s lemma, we have a maximal 
(Y, g) in {(X, f}. We claim that A — Y is finite. For, if A — Y is infinite, then this contains a 
countably infinite subset D, and g can be extended to a bijective map of (Y ù D) x 2 onto Yù D 
contrary to the maximality of (Y, g). Thus 4 — Yis finite. Then 


|A x 2| = (Y x 2)U(A— Y)) x 2| = |Y x 2 


= |Y|=|Al. 
We can extend the last result slightly to 
(1) |A U B| = |B| 


if B is infinite and |B| > |A|. This follows from 
IB| <|A U B| <|B x 2| = |B]. 


The reader is undoubtedly familiar with the fact that |A x A| = |A| if A is countably infinite, 
which is obtained by enumerating œ x œ as 


(0,0), (0, 1), (1,0), (0,2), (1, 1), (2,0),... . 


More generally we have |A x A| = |A| for any infinite 4. The proof is similar to the one for 
addition. We consider the set of pairs (X, f) where X is an infinite subset of A and fis a 
bijective map of X x X onto X and we order the set {(X, f)} as before. By Zorn’s lemma, we 
have a maximal (Y, g) in this set. The result we wish to prove will follow if |Y = |A|. Hence 
suppose |Y] < |A|. Then the relation A = Yù (A -— Y) and the result on addition imply that |A| = |A — 
Y|. Hence |4 — Y| > |Y]. Then A — Y contains a subset Z such that |Z] = |Y. Let W = YU Z, so W= Y 
ù Zand W x Wis the disjoint union of the sets Y x Y, Y x Z, Z x Y, and Z x Z. We have 


(Y x Z)U(Z x Y)u(Z x Z| = |Z x Z| = |Z|. 


Hence we can extend g to a bijective map of W x W onto W. This contradicts the maximality of 
(Y, 2). Hence |¥| = |A| and the proof is complete. 
We also have the stronger result that if A 4 ¢ and B is infinite and |B| > |A|, then 


(2) |A x B| = |B]. 


This follows from 
|B) < |A x B| < |B x B| = |B]. 


0.3 ORDINAL AND CARDINAL NUMBERS 


In axiomatic set theory no distinction is made between sets and elements. One writes A € B for 
“the set A is a member of the set B.” This must be distinguished from A c B, which reads “4 is 
a subset of B.” (In the texts on set theory one finds A c B for our A c B and A C B for our A F 
B.) One defines A c B to mean that Ce A => Ce B. One of the axioms of set theory is that 
given an arbitrary set C of sets, there is a set that is the union of the sets belonging to C, that is, 
for any set C there exists a set U such that A € U if and only if there exists a B such that A € B 
and B e C. In particular, for any set A we can form the successor A* = A U {4A} where {A} is 
the set having the single member A. 

The process of forming successors gives a way of defining the set œ (= N) of natural 
numbers. We define 0 = ¢, 1 = ¢° = {¢}, 2 =I", ... n +l=n", ... and we define œ to be the 
union of the set of natural numbers n. The natural number n and the set œw are ordinal numbers 
in a sense that we shall now define. First, we define a set S to be transitive if A € Sand Be A 
= Be S. This is equivalent to saying that every member of S is a subset. We can now give 


DEFINITION 0.1. An ordinal is a set a that is well ordered by € and is transitive. 


The condition A € A is excluded by the axioms of set theory. We write A < B for A € a, Be 
aifAe Bor A=B. It is readily seen that every natural number n is an ordinal and the set œ of 
natural numbers is an ordinal. Also w*,(@”)*, ... are ordinals. The union of these sets is also 
an ordinal. This is denoted as œ + œ or œ x 2. 

We shall now state without proofs some of the main properties of ordinal numbers. 

Two partially ordered sets $| <, and S5, <, are said to be similar if there exists an order- 
preserving bijective map of S, onto S,. The ordinals constitute a set of representatives for the 
similarity classes of well-ordered sets. For we have the following theorem: If S, < is well 
ordered, then there exists a unique ordinal a and a unique bijective order-preserving map of S 
onto a. Ifa and p are ordinals, either a = p, a < p, or P < a. An ordinal a is called a successor 
if there exists an ordinal $ such that a = £*. Otherwise, a is called a limit ordinal. Any non- 
vacuous set of ordinals has a least element. 


DEFINITION 0.2. A cardinal number is an ordinal a such that if $ is any ordinal such that 
the cardinality |f| = lal, then a < $. 


A cardinal number is either finite or it is a limit ordinal. On the other hand, not every limit 
ordinal is a cardinal. For example œ + œw is not a cardinal. The smallest infinite cardinal is œ. 
Cardinals are often denoted by the Hebrew letter “aleph” X with a subscript. In this notation 


one writes No for œ. 

Since any set can be well ordered, there exists a uniquely determined cardinal a such that | 
a| = |S| for any given set S. We shall now call a the cardinal number or cardinality of S and 
indicate this by |S| =a. The results that we obtained in the previous section yield the following 
formulas for cardinalities 


(3) |A U B| = |B| 


if B is infinite and |B| > |A|. Here, unlike in equation (1), |S] denotes the cardinal number of the 
set S. Similarly we have 


(4) |A x B| = |B| 


if A is not vacuous and |B] is infinite and > |A]. 


).4 SETS AND CLASSES 


There is an axiomatization of set theory different from the ZF system that permits the discussion 
of collections of sets that may not themselves be sets. This is a system of axioms that is called 
the Gédel—Bernays (or GB) system. The primitive objects in this system are “classes” and 
“sets” or more precisely class variables and set variables together with a relation €. A 
characteristic feature of this system is that classes that are members of other classes are sets, 
that is, we have the axiom: Ye X => Yis a set. 

Intuitively classes may be thought of as collections of sets that are defined by certain 
properties. A part of the GB system is concerned with operations that can be performed on 
classes, corresponding to combinations of properties. A typical example is the intersection of 
classes, which is expressed by the following: For all X and Y there exists a Z such that u € Z if 
and only ifu e Xand u e Y. We have given here the intuitive meaning of the axiom: V X V Y3 
ZVu(tueZeaue Xandue Y) where V is read “for all” and J is read “there exists.” 
Another example is that for every X there exists a Y such that u e Yif and only if u ¢ X (VY X4 
YVu(ue YS~ue X), where ~... is the negation of ...). Other class formations correspond 
to unions, etc. We refer to Cohen’s book for a discussion of the ZF and the GB systems and 
their relations. We note here only that it can be shown that any theorem of ZF is a theorem of 
GB and every theorem of GB that refers only to sets is a theorem of ZF. 

In the sequel we shall use classes in considering categories and ina few other places where 
we encounter classes and then show that they are sets by showing that they can be regarded as 
members of other classes. The familiar algebraic structures (groups, rings, fields, modules, 
etc.) will always be understood to be “small,” that is, to be based on sets. 
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Categories 


In this chapter and the next one on universal algebra we consider two unifying concepts that 
permit us to study simultaneously certain aspects of a large number of mathematical structures. 
The concept we shall study in this chapter is that of category, and the related notions of functor 
and natural transformation. These were introduced in 1945 by Eilenberg and MacLane to 
provide a precise meaning to the statement that certain isomorphisms are “natural.” A typical 
example is the natural isomorphism between a finitedimensional vector space V over a field 
and its double dual V**, the space of linear functions on the space V* of linear functions on V. 
The isomorphism of V onto V** is the linear map associating with any vector xe V the 
evaluation function f ~ f(x) defined for all feV*. To describe the “naturality”’ of this 
isomorphism, Eilenberg and MacLane had to consider simultaneously all finite-dimensional 
vector spaces, the linear transformations between them, the double duals of the spaces, and the 
corresponding linear transformations between them. These considerations led to the concepts 
of categories and functors as preliminaries to defining natural transformation. We shall discuss 
a generalization of this example in detail in section 1.3. 

The concept of a category is made up of two ingredients: a class of objects and a class of 
morphisms between them. Usually the objects are sets and the morphisms are certain maps 
between them, e.g., topological spaces and continuous maps. The definition places on an equal 
footing the objects and the morphisms. The adoption of the category point of view represents a 
shift in emphasis from the usual one in which objects are primary and morphisms secondary. 
One thereby gains precision by making explicit at the outset the morphisms that are allowed 
between the objects collected to form a category. 

The language and elementary results of category theory have now pervaded a substantial 
part of mathematics. Besides the everyday use of these concepts and results, we should note 
that categorical notions are fundamental in some of the most striking new developments in 
mathematics. One of these is the extension of algebraic geometry, which originated as the study 
of solutions in the field of complex numbers of systems of polynomial equations with complex 
coefficients, to the study of such equations over an arbitrary commutative ring. The proper 


foundation of this study, due mainly to A. Grothendieck, is based on the categorical concept of 
a scheme. Another deep application of category theory is K. Morita’s equivalence theory for 
modules, which gives a new insight into the classical Wedderburn-Artin structure theorem for 
simple rings and plays an important role in the extension of a substantial part of the structure 
theory of algebras over fields to algebras over commutative rings. 

A typical example of a category is the category of groups. Here one considers “all” groups, 
and to avoid the paradoxes of set theory, the foundations need to be handled with greater care 
than is required in studying group theory itself. One way of avoiding the well-known 
difficulties is to adopt the Godel-Bernays distinction between sets and classes. We shall 
follow this approach, a brief indication of which was given in the Introduction. 

In this chapter we introduce the principal definitions of category theory— functors, natural 
transformations, products, coproducts, universals, and adjoints—and we illustrate these with 
many algebraic examples. This provides a review of a large number of algebraic concepts. We 
have included some nontrivial examples in order to add a bit of seasoning to a discussion that 
might otherwise appear too bland. 


1.1 DEFINITION AND EXAMPLES OF CATEGORIES 


DEFINITION 1.1. A category C consists of 
1. A class ob C of objects (usually denoted as A, B, C, etc.). 
2. For each ordered pair of objects (A,B) a set hom,(A,B) (or simply hom(A,B) if C is 


clear) whose elements are called morphisms with domain A and codomain B (or from 
Ato B). 


3. For each ordered triple of objects (A,B,C), a map (fg) ~gf of the product set hom (A, B) 
x hom (B, C) into hom (A, C). 
It is assumed that the objects and morphisms satisfy the following conditions: 

C1. If (4, B) # (CD), then hom (A,B) and hom(C,D) are disjoint. 

C2. (Associativity). If fe hom (4, B) gehom(B,C), and he hom(C, D), then (hg)f = h(gf). 
(As usual, we simplify this to hgf.) 

C3. (Unit). For every object A we have an element 1,€hom (A, A) such that fl, = f for 
every fe hom(4,B) and /,g = g for every g e hom (B, A). (1, is unique.) 


Iffe hom (A, B) we write f: A — B or A 4 B (sometimes A 7 B), and we call fan arrow 
from A to B. Note that gf is defined if and only if the domain of g coincides with the codomain 
of f and gf has the same domain as f and the same codomain as g. 


The fact that gf = h can be indicated by saying that 


/ 


is a commutative diagram. The meaning of more complicated diagrams is the same as for maps 
of sets (BAI, pp. 7—8). For example, the commutativity of 


A f B 
|] 
c k D 


means that gf = kh, and the associativity condition (hg)f = h(gf) is expressed by the 
commutativity of 


A D 
p<] 
B g E 


for all fe hom (4,8) and all g € hom (B,A). 

We remark that in defining a category it is not necessary at the outset that the sets hom(4,B) 
and hom (C, D) be disjoint whenever (4,8) + (C,D). This can always be arranged for a given 
class of sets hom(4,B) by replacing the given set hom (4,8) by the set of ordered triples (4,B,/) 
where f e hom (4,8). This will give us considerably greater flexibility in constructing 
examples of categories (see exercises 3—6 below). 

We shall now give a long list of examples of categories. 


EXAMPLES 


1. Set, the category of sets. Here ob Set is the class of all sets. If A and B are sets, hom (4, 


B) = B4, the set of maps from A to B. The product gf is the usual composite of maps and 1, is 
the identity map on the set A. The validity of the axioms C1, C2, and C3 is clear. 


2. Mon, the category of monoids, ob Mon is the class of monoids (BAI, p. 28), hom (M, N) 
for monoids M and N is the set of homomorphisms of M into N, gf is the composite of the 
homomorphisms g and f and 1), is the identity map on M (which is a homomorphism). The 


validity of the axioms is clear. 


3. Grp, the category of groups. The definition is exactly like example 2, with groups 
replacing monoids. 


4. Ab, the category of abelian groups, ob Ab is the class of abelian groups. Otherwise, 
everything is the same as in example 2. 


A category D is called a subcategory of the category C if ob D is a subclass of obC and for 
any A,BeobD, homp(4,B) c hom,(A,B). It is required also (as part of the definition) that 1, for 
A €ob D and the product of morphisms for D is the same as for C. The subcategory D is called 
full if homp(4,B) = hom;(A,B) for every 4,BeD. It is clear that Grp and Ab are full 


subcategories of Mon. On the other hand, since a monoid is not just a set but a triple (Zp, 1) 
where p is an associative binary composition in M and 1 is the unit, the category Mon is not a 
subcategory of Set. We shall give below an example of a subcategory that is not full (example 
10). 

We continue our list of examples. 


5. Let M be a monoid. Then M defines a category M by specifying that obM = {A}, a set 
with a single element A, and defining hom (4, A) = M, /, the unit of M, and xy for x, ye hom 


(A, A), the product of x and y as given in M. It is clear that M is a category with a single object. 
Conversely, let M be a category with a single object: obM = {A}. Then M = hom (4, A) is a 
monoid. It is clear from this that monoids can be identified with categories whose object 
classes are single-element sets. 


A category is called small if ob C is a set. Example 5 is a small category; 1—4 are not. 
An element fe hom (A, B) is called an isomorphism if there exists a géhom(B,A) such that 
fz = lp and gf = 14. It is clear that g is uniquely determined by f so we can denote it as f-t. 


This is also an isomorphism and (f)! =f If f and h are isomorphisms and fh is defined, then 
fh is an isomorphism and (fh)! = h"!f-!. In Set the isomorphisms are the bijective maps, and in 
Grp they are the usual isomorphisms (= bijective homomorphisms). 


6. Let G be a group and let this define a category G with a single object as in example 5. 
The characteristic property of this type of category is that it has a single object and all arrows ( 
= morphisms) are isomorphisms. 


7. A groupoid is a small category in which morphisms are isomorphisms. 


8. A discrete category is a category in which hom (4,B) = Ø if AFB and hom (A, A) = {1,}. 
Small discrete categories can be identified with their sets of objects. 


9. Ring, the category of (associative) rings (with unit for the multiplication composition). 
obRing is the class of rings and the morphisms are homomorphisms (mapping 1 into 1). 


10. Rng, the category of (associative) rings without unit (BAI, p. 155), homomorphisms as 
usual. Ring is a subcategory of Rng but is not a full subcategory, since there exist maps of 
rings with unit that preserve addition and multiplication but do not map 1 into 1. (Give an 
example.) 


11. R-mod, the category of left modules for a fixed ring R. (We assume 1x = x for x ina left 
R-module M.) ob R-mod is the class of left modules for R and the morphisms are JR-module 
homomorphisms. Products are composites of maps. If R = A is a division ring (in particular, a 
field), then R-mod is the category of (left) vector spaces over A. In a similar manner one 
defines mod-R as the category of right modules for the ring R. 


12. Let S be a pre-ordered set, that is, a set S equipped with a binary relation a < b such 
that a < a and a < b and b < c imply a < c. S defines a category S in which ob S = S and for 
a,beS, hom (A, B) is vacuous or consists of a single element according as a < b or a < b. Iff € 
hom (a, b) and g € hom (b, c), then gf is the unique element in hom (a, c). It is clear that the 
axioms for a category are satisfied. Conversely, any small category such that for any pair of 
objects A, B, hom (A, B) is either vacuous or a single element is the category of a pre-ordered 
set as just defined. 


13. Top, the category of topological spaces. The objects are topological spaces and the 
morphisms are continuous maps. The axioms are readily verified. 


We conclude this section by giving two general constructions of new categories from old 
ones. The first of these is analogous to the construction of the opposite of a given ring (BAI, p. 


113). Suppose C is a category; then we define C°? by obC® = obC; for 4, BeobC®, 
home op(4,B) = hom,(B,A); if fehomc,p(A.B) and g €hom,,p(B,C), then g-f (in C%?) =fg (as 
given in C). /, is as in C. It is clear that this defines a category. We call this the dual category 
of C. Pictorially we have the following: If A 4 B in C, then A £B in C® and if 


A f B 


is commutative in C, then 


is commutative in C°?. More generally, any commutative diagram in C gives rise to a 
commutative diagram in C® by reversing all of the arrows. 

Next let C and D be categories. Then we define the product category C x D by the 
following recipe: ob C x D = ob C x ob D; if A, B € ob C and 4’, B' € ob D, then hom ¢ xp 
((4 ,A’), (B, BY) = homc (4,B) x homp(4’, B’), and 1y, 4) =(14,1,); if f € homc(4,B), g € 
homc(B, C), f € homp (4’, B’), and g’ € homp(B’, C’), then 


(9.9) (41) = gf’). 


The verification that this defines a category is immediate. This construction can be generalized 
to define the product of indexed sets of categories. We leave it to the reader to carry out this 
construction. 


EXERCISES 


1. Show that the following data define a category Ring*: obRing* is the class of rings; if R 
and S are rings, then homping*(R, S) is the set of homomorphisms and anti- 
homomorphisms of R into S; gf for morphisms is the composite g following f for the 
maps f and g; and 1p is the identity map on R. 


2. By a ring with involution we mean a pair (R,j) where R is a ring (with unit) and j is an 
involution in R; that is, if j:a ~ a*, then (a + b)* = a*+ b*, (ab)* = b*a*, 1* = 1, (a*)* 
=a. (Give some examples.) By a homomorphism of a ring with involution (R,J) into a 
second one (S, k) we mean a map y of R into 5 such that 7 is a homomorphism of R into 
S (sending 1 into 1) such that ya) = kna) for all aeR. Show that the following data 
define a category Rinv: obRinv is the class of rings with involution; if (R,J) and (S,k) 
are rings with involution, then hom ((R,J), (S,4)) is the set of homomorphisms of (R,/) 
into (S,k); gf for morphisms is the composite of maps; and lg 7) = lp. 


3. Let C be a category, A an object of C. Let obC/A = UE obehom (X, A) so obC/4 is the 


class of arrows in C ending at A. If fehom/(B, A) and g e hom (C, A), define hom (fg) to 
be the set of u :B ~ C such that 


is commutative. Note that hom (f, g) and hom (/’, g’) may not be disjoint for (f, g) £ (f 
’, 2). Ifuehom(fg) and vehom(g, h) for h : D — A, then vu € hom (f, A). Use this 
information to define a product from hom (f, g) and hom (g, h) to hom (f, A). Define 
1;= 1, for f: B — A. Show that these data and the remark on page 11 can be used to 


define a category C/A called the category of objects of C over A. 
. Use C% to dualize exercise 3. This defines the category C A of objects of C below A. 


. Let C be a category, A}, A,e0b C. Show that the following data define a category C/{A), 
A} ' The objects are the triples (B, fp f2) where fje hom,(B, 4;). A morphism A : (B, fj, 
fa) ~ (C, Zp g2) is a morphism A : B — C in C such that 


is commutative. Arrange to have the hom sets disjoint as before. Define l(g f, f) = 1p 
and the product of morphisms as in C. Verify the axioms C2 and C3 for a category. 


. Dualize exercise 5 by applying the construction to C°? and interpreting in C. The 
resulting category is denoted as C\{A), A>}. 


. (Alternative definition of a groupoid.) Let G be a groupoid as defined in example 7 
above and let G = U, p-,,G""(4,B). Then G is a set equipped with a composition fg 


that is defined for some pairs of elements (f,g),f ige G, such that the following conditions 
hold: 


(i) For any f € G there exist a uniquely determined pair (u, v), u,veEG such that uf and fv 
are defined and uf = f = fv. These elements are called the left and right units 
respectively of f. 


(ii) If u is a unit (left and right for some fe G), then u is its own left unit and hence its own 
right unit. 

(iii) The product fg is defined if and only if the right unit of f coincides with the left unit 
of g. 

(iv) If fg and gh are defined, then (fg)h and f(gh) are defined and (fg)h = feh). 


(v) Iffhas left unit u and right unit v, then there exists an element g having right unit u and 
left unit v such that fg = u and gf = v. 


Show that, conversely, if G is a set equipped with a partial composition satisfying 
conditions (i)-(v), then G defines a groupoid category G in which ob G is the set of units 
of G; for any objects u, v, hom (u, v) is the subset of G of elements having u as left unit 


and v as right unit; the product composition of hom (u, v) x hom (v, w) is that given in G. 


8. Let G be as in exercise 7 and let G* be the disjoint union of G and a set {0}. Extend the 
composition in G to G* by the rules that 0a = 0 = ap for all aeG* and fg = 0 if fgeG 


and fg is not defined in G. Show that G* is a semigroup (BAI, p. 29). 


1.2 SOME BASIC CATEGORICAL CONCEPTS 


We have defined a morphism f in a category C to be an isomorphism if f : A — B and there 
exists a g :B — A such that fg = 1p and gf = 14. Iff: A — B, g : B — A, and gf = 14, then f is 
called a section of g and g is called a retraction of f. More interesting than these two concepts 
are the concepts of monic and epic that are defined by cancellation properties: A morphism f : 
A — B is called monic (epic) if it is left (right) cancellable in C; that is, if g} and g, € hom (C, 
A) (hom (B, C)) for any C and fg, = fg (gif = gf), then g} = g. The following facts are 
immediate consequences of the definitions: 

1. If A 4 B and B & C and f and g are monic (epic), then gf is monic (epic). 

2. If A 4 B and B & C and gf is monic (epic), then fis monic (g is epic). 

3. If fhas a section then f is epic, and if f has a retraction then f is monic. 

If fis a map ofa set A into a set B, then it is readily seen that f is injective (that is, f(a) # f(a 
’) for a + a'in A) if and only if for any set C and maps g,g, of C into A, fg, = fg, implies g, = 
g (exercise 3, p. 10 of BAI). Thus f € homs,{A,B) is monic if and only if fis injective. 


Similarly, f is epic in Set if and only iff is surjective (A4) = B). Similar results hold in the 
categories R-mod and Grp: We have 


PROPOSITION 1.1. 4 morphism f in R-mod or in Grp is monic (epic) if and only if the 
map of the underlying set is injective (surjective). 


Proof. Letf:A — B in R-mod or Grp. Iff is injective (surjective) as a map of sets, then it is 
left (right) cancellable in Set. It follows that fis monic (epic) in R-mod or Grp. Now suppose 
the set map f is not injective. Then C = kerf + 0 in the case of R-mod and + 1 in the case of 
Grp. Let g be the injection homomorphism of C into A (denoted by C + A), so g(x) = x for 
every x € C. Then fg is the homomorphism of C into B, sending every x e C into the identity 
element of B. Next let A be the homomorphism of C into A, sending every element of C into the 
identity element of A. Then A + g since C £ 0 (or # 1), but fg = fh. Hence fis not monic. 

Next suppose we are in the category R-mod and f is not surjective. The image f(A) is a 
submodule of B and we can form the module C = B/f(A), which is 40 since f(A) # B. Let g be 
the canonical homomorphism of B onto C and h the homomorphism of B into C, sending every 
element of B into 0. Then g + h but gf = hf. Hence fis not epic. 

Finally, suppose we are in the category Grp and f is not surjective. The foregoing argument 


will apply if C = f(A)=B (C is a normal subgroup of B). This will generally not be the case, 
although it will be so if [B :C] = 2. Hence we assume [B : C] > 2 and we shall complete the 
proof by showing that in this case there exist distinct homomorphisms g and A of B into the 
group Sym B of permutations of B such that gf = hf We let g be the homomorphism b ~ bz of B 
into Sym B where bz is the left translation x ~bx in B. We shall take A to have the form kg 


where k is an inner automorphism of Sym B. Thus k has the form y~pyp?! where y € Sym B 
and p is a fixed element of Sym B. Then A = kg will have the form b ~ pb,p! and we want this 
to be different from g -b ~ bz. This requires that the permutation p does not commute with 
every bz. Since the permutations commuting with all of the left translations are the right 


translations (exercise 1, p. 42 of BAD, we shall have h = kg + g ifp is not a right translation. 
Since translations # 1 have no fixed points, our condition will be satisfied if p is any 
permutation + 1 having a fixed point. On the other hand, to achieve the condition gf = hf we 
require that p commutes with every cz, ceC. To construct a p satisfying all of our conditions, 


we choose a permutation z of the set C\B of right cosets Cb, be B, that is not the identity and 
has a fixed point. Since |C\B| > 2, this can be done. Let 7 be a set of representatives of the right 
cosets Cb. Then every element of B can be written in one and only one way as a product cu, 
cEC, uel. We now define the map p by p(cu) = cu' where 2(Cu) = Cu'. Then peSymB, p = 1, 
and p has fixed points since z # 1 and z has fixed points. It is clear that p commutes with 
every dz, deC. Hence p satisfies all of our requirements and fis not epic. O 


What can be said about monies and epics in the category Ring? In the first place, the proof 
of Proposition 1.1 shows that injective homomorphisms are monic and surjective ones are 
epic. The next step of the argument showing that monies are injective breaks down totally, 
since the kernel of a ring homomorphism is an ideal and this may not be a ring (with unit). 
Moreover, even if it were, the injection map of the kernel is most likely not a ring 
homomorphism. We shall now give a different argument, which we shall later generalize (see 
p. 82), to show that monies in Ring are injective. 

Let f be a homomorphism of the ring A into the ring B that is not injective. Form the ring A ® 
A of pairs (a;,a2), a;€ A, with component-wise addition and multiplication and unit 1 = (1, 1). 
Let K be the subset of A ® A of elements (a),a5) such that f(a,)= f(a>). It is clear that K is a 
subring of A ® A and K F D= {(a,a)|a € A}. Let g; be the map (a), a>) ~ a; and g, the map (a, 
ay) ~ ay from K to A. These are ring homomorphisms and fg; =fg>, by the definition of K. On 
the other hand, since K # D, we have a pair (a), a,)€K with a, # a>. Then g)(ay,a5) = a, £ ay = 
g2(a,a3). Hence g; # g which shows that f is not monic. 

Now we can show by an example that epics in Ring need not be surjective. For this 
purpose we consider the injection homomorphism of the ring Z of integers into the field @ of 
rationals. If g and A are homomorphisms of @ into a ring R, then gf= hf if and only if the 
restrictions g|Z = A|Z. Since a homomorphism of @ is determined by its restriction to Z, it 
follows that gf = hf implies g = A. Thus fis epic and obviously f is not surjective. 

We have proved 


PROPOSITION 1.2. 4 morphism in Ring is monic if and only if it is injective. However, 
there exist epics in Ring that are not surjective. 

The concept of a monic can be used to define subobjects oa an object A of a category C. We 
consider the class of monicsending in A 


`` 


z A 

"g 
We introduce a preorder in the class of these monies by declaring that f < g if there exists a k 
such that f= gk. It follows that k is monic. We write f = g iff < g and g < f. In this case the 
element k is an isomorphism. The relation = is an equivalence and its equivalence classes are 
called the subobjects of A. 

By duality we obtain the concept of a quotient object of 4. Here we consider the epics 
issuing from A and define f > g if there exists a k such that f = kg. We have an equivalence 
relation f = g defined by f = kg where k is an isomorphism. The equivalence classes 
determined by this relation are called the quotient objects of A. 

If the reader will consider the special case in which C = Grp, he will convince himself that 
the foregoing terminology of subobjects and quotient objects of the object Æ is reasonable. 
However, it should be observed that the quotient objects defined in Ring constitute a larger 
class than those provided by surjective homomorphisms. 


EXERCISES 


1. Give an example in Top of a morphism that is monic and epic but does not have a 
retraction. 


2. Let G be a finite group, H a subgroup. Show that the number of permutations of G that 
commute with every h,, heH (acting in G), is [G -H]!|H|!°'“! where [G-H] is the index 
of HinG. 


1.3 FUNCTORS AND NATURAL TRANSFORMATIONS 


In this section we introduce the concept of a functor or morphism from one category to another 
nthat commute with ad the concept of maps between functors called natural transformations. 
Before proceeding to the definitions we consider an example. 

Let R be a ring and let U(R) denote the multiplicative group of units ( = invertible elements) 
of R. The map R ~ U/(R) is a map of rings into groups, that is, a map of ob Ring into ob Grp. 
Moreover, iff :R — S is a homomorphism of rings, then the restriction f |U(R) maps U(R) into 
U(S) and so may be regarded as a map of U(R) into (U(S). Evidently this is a group 


homomorphism. It is clear also that if g -S — T is a ring homomorphisM, then (gf)|U(R) = 
(g| U(S)) (| U(R)). Moreover, the restriction 1 pU(R) is the identity map on U(R). 

The map R ~ U(R) of rings into groups and f~f| U(R) of ring homomorphisms into group 
homomorphisms constitute a functor from Ring to Grp in the sense of the following definition. 


DEFINITION 1.2. Jf C and D are categories, a (covariant) functor F from C to D consists 
of 
1. A map A ~ FA of ob C into ob D. 
2. For every pair of objects (4,B) of C, a map f ~ Ff) of hom,(A, B) into homp (F4,FB). 
We require that these satisfy the following conditions: 
FI. If gf is defined in C, then F(gf) = F(g)F(/). 
F2. F(1,) = ly. 
The condition F1 states that any commutative triangle 


B 
f 


in C is mapped into a commutative triangle in D 


Fip 


FA Fig) 


F(h) 


A contravariant functor from C to D is a functor from C°? to D. More directly, this is a 
map F of ob C into ob D and for each pair (A, B) of objects in C, a map F of hom (4,B) into 
hom (FB,FA) such that F(fg) = F(&)F(f) and F(1,) = lz4. A functor from B x C to D is called a 
bifunctor from B and C into D. We can also combine bifunctors with contravariant functors to 
obtain functors from B x C to D and from B® x C% to D. The first is called a bifunctor that 
is contravariant in B and covariant in C and the second is a bifunctor that is contravariant in 
B and C. 


EXAMPLES 
1. Let D be a subcategory of the category C. Then we have the injection functor of D into C 
that maps every object of D into the same object of C and maps any morphism in D into the 


same morphism in C. The special case in which D = C is called the identity functor |,. 


2. We obtain a functor from Grp to Set by mapping any group into the underlying set of the 


group and mapping any group homomorphism into the corresponding set map. The type of 
functor that discards some of the given structure is called a “forgetful” functor. Two other 
examples of forgetful functors are given in the next example. 


3. Associated with any ring (R, + ,-,0, 1) we have the additive group (R, + ,0) and the 
multiplicative monoid (R,., 1). A ring homomorphism is in particular a homomorphism of the 
additive group and of the multiplicative monoid. These observations lead in an obvious 
manner to definitions of the forgetful functors from Ring to Ab and from Ring to Mon. 


4. Let n be a positive integer. For any ring R we can form the ring M„(R) ofn x n matrices 
with entries in R. A ring homomorphism f: R — S determines a homomorphism (rj) ~ (f(r;)) of 
M,,(R) into M,(S). In this way we obtain a functor M, of Ring into Ring. 


5. Let n and R be as in example 4 and let GL,,(R) denote the group of units of M,,(R), that is, 
the group of n x n invertible matrices with entries in R. The maps R ~ GL,(R),f into (7) ~ 
(f(r;;)) define a functor GL, from Ring to Grp. 


6. We define the power functor # in Set by mapping any set A into its power set @(A) and 
any set map f:A — B into the induced map fa of P(A) into P(B) which sends any subset A, of A 


into its image (4) c B(O ~ 8). 


7. The abelianizing functor from Grp to Ab. Here we map any group G into the abelian 
group G/(G, G) where (G, G) is the commutator group of G (BAI, p. 238). If f is a 
homomorphism of G into a second group H,f maps (G, G) into (H,H) and so induces a 
homomorphism / of G/(G, G) into H/(H,H). The map f ~ f completes the definition of the 
abelianizing functor. 


8. Let Poset be the category of partially ordered sets. Its objects are partially ordered sets 
(BAI, p. 456) and the morphisms are order-preserving maps. We obtain a functor from R-mod 
to Poset by mapping any R-module M into L(M), the set of submodules of M ordered by 
inclusion. If f. M — N is a module homomorphisM, f determines an order-preserving map of 
L(M) into L(N). These maps define a functor. 


9. We define a projection functor of C x D into C by mapping any object (4, B) of C x D 
into the object A of C and mapping (fg)e hom ((A, B) (A'B’)) into fe hom (A, A’). 


10. We define the diagonal functor C — C x C by mapping A ~ (A, A) and f:A — B into 
(A,A)42.(B,B). 


11. Consider the categories R-mod and mod-F of left R-modules and right R-modules 
respectively for the ring R. We shall define a contravariant functor D from R-mod to mod-R as 
follows. If M is a left R-module, we consider the set M* = homg(M,R) of homomorphisms of 


M into R regarded as left R-module in the usual way. Thus M* is the set of maps of M into R 


such that 
I(x+y) =f) +f) 
Jf (rx) = rf (x) 
forx,ye M,r € R. Iff, ge M* ands e R, then f + g defined by (f+ 2) (x) = f(x) + g(x) and 
fs defined by (fs) (x) = f(x)s are in M*. In this way M* becomes a right R-module and we have 


the map M ~ M* of obj R-mod into obmod-R. Now let L : M — N be a homomorphism of the 
R-module M into the R-module N. We have the transposed map L* :N* ° M* defined as 


[*:9 ~ gL, 


the composite of g and L: 


If m =m, “m, in R-mod and g € M;*, then (LL )*(8) = gLloL, = (gLly)L, = L*,*L,*(g). Hence 
(L,L,)* = LL. 
It is clear that (1,4)* = 14x. It follows that 
D:M ~> M*, Lsp 


defines a contravariant functor, the duality functor D from R-mod to mod-R. In a similar 
fashion one obtains the duality functor D from mod-R to R-mod. 


It is clear that a functor maps an isomorphism into an isomorphism: If we have f, = 1p, gf = 
1,, then F()F(g) = lpg and F(g)F(/) = 1py. Similarly, sections are mapped into sections and 
retractions are mapped into retractions by functors. On the other hand, it is easy to give 
examples to show that monies (epics) need not be mapped into monies (epics) by functors (see 
exercise 3 below). 

If F is a functor from C to D and G is a functor from D to E, we obtain a functor GF from C 
to E by defining (GF)A = G(F4) for ,Ac obC and (GF) (£) = G(F()) for fehom,(4,B). In a 
similar manner we can define composites of functors one or both of which are contravariant. 
Then FG is contravariant if one of E G is contravariant and the other is covariant, and FG is 
covariant if both F and G are contravariant. Example 5 above can be described as the 
composite UM, where M, is the functor defined in example 4 and U is the functor from Ring 
to Grp defined at the beginning of the section. As we shall see in a moment, the double dual 
functor D? from R-mod to itself is a particularly interesting covariant functor. 

A functor F is called faithful (full) if for every pair of objects A, B in C the map f ~ F(f) of 


hom,(A, B) into homp(FA, FB) is injective (surjective). In the foregoing list of examples, 
example 1 is faithful and is full if and only if D is a full subcategory of C; examples 2 and 3 
are faithful but not full (why?); and example 9 is full but not faithful. 

We shall define next the important concept of natural transformation between functors. 
However, before we proceed to the definition, it will be illuminating to examine in detail the 
example mentioned briefly in the introduction to this chapter. We shall consider the more 
general situation of modules. Accordingly, we begin with the category R-mod for a ring R and 
the double dual functor D? in this category. This maps a left R-module M into M** = (M*)* and 
a homomorphism L :M — N into L** = (L*)* :M** > N**, Ifx e M, g e N*, then L*g e M* 
and (L*g)(x) = g(Lx). If © e M**, L** e N** and (L**®) (g) = O(L*g). We now consider 
the map 


N(x) if > f(x) 


of M* into R. This is contained in M** = hompy(M*,R) and the map yųy:x ~ nyx) is an R- 
homomorphism of M into M**. Now for any homomorphism L:M ~ N, the diagram 


M NM M** 
(1) L L** 
N Ny N** 


is commutative, because if x € M, then 7)(Lx) is the map g ~ g(Lx) of N* into R and for ọ = 
nyx) E M**, (L**—) (g) = (p(L*g). Hence (L**ny(x)) (8) = nM(x) (L*g) = (L*g) (x) = g(Lx). 
We now introduce the following definition of “naturality.” 


DEFINITION 1.3. Let fand G be functors from C to D. We define a natural transformation 7 
from f to G to be a map that assigns to every object A in C a morphism n ,€homp(FA, GA) 


such that for any objects A,B of C and any fehom,(A,B) the rectangle in 
A FA ™ GA 
B FB Tja GB 


is commutative. Moreover, if every ņ4 is an isomorphism then n is called a natural 
isomorphism. 
In the foregoing example we consider the identity functor 7 R_mog and the double dual functor 


D? on the category of left R-modules. For each object M of R-mod we can associate the 
morphism 4, of M into M**. The commutativity of (1) shows that 7 :M ~ yj, is a natural 
transformation of the identity functor into the double dual functor. 

We can state a stronger result if we specialize to finite dimensional vector spaces over a 
division ring 4. These form a subcategory of 4-mod. If V is a finite dimensional vector space 
over A, we can choose a base (€),é3,...,e,,) for V over å. Let e;* be the linear function on V 


such that e;*(ej) =ô; Then (e;*, e2*, ... €„*,) is a base for V* as right vector space over A—the 
dual (or complementary) base to the base (e)...,¢,). This shows that V* has the same 


dimensionality n as V. Hence V** has the same dimensionality as V. Since any non-zero vector 
x can be taken as the element e; of the base (e;, e>, ..., e,), it is clear that for any x + 0 in V 


there exists a g e V* such that g(x) # 0. It follows that for any x # 0 the map 7,(x) :f ~ f(x) is 
non-zero. Hence 7,,:x ~ n,(“) is an injective linear map of V into V**. Since dim V** = dim V, 
it follows that y, is an isomorphism. Thus, in this case, 7 is a natural isomorphism of the 


identity functor on the category of finite dimensional vector spaces over 4 onto the double dual 
functor on this category. 

We shall encounter many examples of natural transformations as we proceed in our 
discussion. For this reason it may be adequate to record at this point only two additional 
examples. 


EXAMPLES 


1. We define the functor ®, in R-mod by mapping any module M into M” the direct sum of 
n copies of M (BAI, p. 175), and any homomorphism f: M — N into 


{” (dy, sae 1M,) oy (f(a, Lam „f (a,)). 
For any M we define the diagonal homomorphism 
Oy 1a (a,...,a). 


Then 5):M +, is a natural transformation of lp mog into ®, since we have the 
commutative diagram 


M by" M 
f Í 
N 5," N 


2. We consider the abelianizing functor as defined in example 7 above, but we now regard 
this as a functor from Grp to Grp rather than from Grp to Ab. (This is analogous to changing 
the codomain of a function.) Let vg denote the canonical homomorphism of G onto the factor 


group. Then we have the commutative diagram 


G Ve GAG,G) 


H = HAH.H) 


which shows that v:G ~ vg is a natural transformation of the identity functor of Grp to Grp to 
the abelianizing functor. 


Let FE G, H be functors from C to D, 7 a natural transformation of F to G, and ¢ a natural 
transformation from G to H. If A € obC then yn, € homp(F4,GA) and ¢, € homp(G4,HA). 


Hence ¢4y4 E€ homp(/4,HA). We have the commutativity of the two smaller rectangles in 


FA u GA U HA 
Fi) Gif) Hip) 
FB Tja GB & HB 


which implies the commutativity of the large rectangle with vertices F4, HA, FB, HB. This 
implies that A ~ Cym; is a natural transformation from F to H. We call this G,, the product of ¢ 


and 7. 

if F is a functor from C to D, we obtain a natural transformation 1, of F into itself by 
mapping any, A € ob C into l; E homp(FA, FA). If 7 is any natural transformation from F to a 
functor G from C to D, then we evidently have 71; = 7 = 1 Gn. 

Let 7 be a natural isomorphism of F to G. Then 7, is an isomorphism 7, : FA — GA for 
every, A € ob C. Hence we have the isomorphism y7! : GA — FA. The required commutativity 
is clear, so A > 7, is a natural isomorphism of G to F. We call this the inverse y! of n (714 
=n). It is clear that we have min = lp and yy! = lç. Conversely, if 7 is a natural 
transformation from F to G and 7 is one from G to F such that ¢ y =1¢ 1p and y¢= 1g, then 7 is 
a natural isomorphism with q~! = ¢. 

If 7 is a natural isomorphism of a functor E of C to C with the functor |,, then the 
commutativity of 


A Ma EA 
f Ef) 
B Tia EB 


shows that E() = ngf 4, which implies that the map f ~ E(f) of hom (A, B) into hom (EA, EB) 


is bijective. 


EXERCISES 


1. Let F be a functor from C to D that is faithful and full and let fehom,(A,B). Show that any 


one of the following properties of F(f) implies the same property for f-F(f/) is monic, is 
epic, has a section, has a retraction, is an isomorphism. 


2.Let M and N be monoids regarded as categories with a single object as in example 5, p. 
12. Show that in this identification, a functor is a homomorphism of M into N and that a 
natural transformation of a functor F to a functor G corresponds to an element bE N such 
that b(Fx) = (Gx)b, xeM. 


3. Use exercise 2 to construct a functor F and a monic (epic) f such that F(f) is not monic 
(epic). 

4. Let G be a group, G the one object category determined by G as in example 6 on p. 12. 
Show that a functor from G to Set is the same thing as a homomorphism of G into the 
group Sym S of permutations of a set S, or, equivalently, an action of G on S (BAI, p. 
72). Show that two such functors are naturally isomorphic if and only if the actions of G 
are equivalent (BAI, p. 74). 


5. Let B, C, D, E be categories, F and G functors from C to D, K a functor from B to C, and 
H a functor from D to E. Show that if 7 is a natural transformation from F to G, then A ~ 
Hn, is a natural transformation from HF to HG for A €obC and B ~7xpz is a natural 


transformation from FK to GK for Be obB. 


6. Define the center of a category C to be the class of natural transformations of the identity 
functor 1ç to lç. Let C= R-mod and let C be an element of the center of R. For any M e€ 


ob R-mod let 7,,(c) denote the map x ~ cx, x e M. Show that 7(c) :M ~ n,/(°) is in the 


center of C and every element of the center of C has this form. Show that c ~ n(c) is a 
bijection and hence that the center of R-mod is a set. 


1.4 EQUIVALENCE OF CATEGORIES 


We say that the categories C and D are isomorphic if there exist functors F :C — D (from C to 
D) and G :D — C such that GF = 1c and FG = 1p. This condition is rather strong, so that in 


most cases in which it holds one tends to identify the isomorphic categories. Here is an 
example. Let C= Ab and D = 2-mod. If M is an abelian group (written additively), M becomes 
a Z-module by defining nx for n € Z, x € M, as the nth multiple of x (BAI, p. 164). On the other 
hand, if M is a 2-module, then the additive group of M is an abelian group. In this way we have 
maps of ob Ab into ob Zmod and of ob Z-mod into ob Ab that are inverses. If f is a 
homomorphism of the abelian group M into the abelian group N, then f(mx) = nf (x), n € Z, x 


e M. Hence fis a homomorphism of M as Z2-module into N as Z-module. Conversely, any Z- 
homomorphism is a group homomorphism. It is clear from this that Ab and 2Z-mod are 
isomorphic categories, and one usually identifies these two categories. 

Another example of isomorphic categories are R-mod and mod-R°? for any ring R. If M is a 
left R-module, M becomes a right R°P-module by defining xr = rx for x e M, r e R? = R (as 
sets). Similarly, any right R°P-module becomes a left R-module by reversing this process. It is 
clear also that a homomorphism of pM, M as left R-module, into pW is a homomorphism of 
Mpop, M as right R°?-module, into Nop. We have the obvious functors F and G such that gf = 
1R-mog and FG = loa go”. Hence the two categories are 


The concept of isomorphism of categories is somewhat too restrictive; a considerably more 
interesting notion is obtained by broadening this in the following manner. We define C and D to 
be equivalent categories if there exist functors F:C — D and G:D — C such that GF = |, and 
FG = lp where = denotes the natural isomorphism of functors. Evidently isomorphism of 
categories implies equivalence. It is clear also that the relation of equivalence between 
categories is what the name suggests: it is reflexive, symmetric, and transitive. 

We note that the functor G in the definition of equivalence is not uniquely determined by F. 
It is therefore natural to shift the emphasis from the pair (F, G) to the functor f and to seek 
conditions on a functor f :C — D in order that there exists a G:D — C such that (F, G) gives an 
equivalence, that is, GF = 1¢ and FG = |p. We have seen that GF = |, implies that the map f~ 
GF(f) of homC(4,B) onto hom,(GFA,GFB) is bijective. Similarly, g ~ FG(g) is bijective of 
homp(A',B') onto homp(FGA',FGB'). Now the injectivity of f ~ GF(f) implies the injectivity 
of the map f~ Ff) of the set hom¢(A,B) into the set homD(FA4,FB) and the surjectivity of g ~ 
FG(g) implies the surjectivity of f ~ Ff). Thus we see that the functor F is faithful and full. 
We note also that given any , A’ € obD, the natural isomorphism FG = lp gives an 
isomorphism 74, € homp(4’, FGA’). Thus if we put A = GA’ e obC, then there is an 
isomorphism contained in homp(4’, FA) or, equivalently, in homp(F4,/’). 

We shall now show that the conditions we have sorted out are also sufficient and thus we 
have the following important criterion. 


PROPOSITION 1.3. Let F be a functor from C to D. Then there exists a functor G :D —> C 
such that (i, G) is an equivalence if and only if F is faithful and full and for every object A’ 
of D there exists an object A of C such that FA and A' are isomorphic in D, that is, there is an 
isomorphism contained in hom p(F4,A4’). 


Proof. It remains to prove the sufficiency of the conditions. Suppose these hold. Then for any 
A € ob D we choose ,A € ob C such that FA and A’ are isomorphic and we choose an 
isomorphism ņa, : A’ — FA. We define a map G of ob D into ob C by A’ ~ A where A is as just 
chosen. Then y4 : A’ — FGA’. Let B' be a second object of D and let f € homp (4’, B’). 


Consider the diagram 


A’ Tu! FGA' 

{ 

B' men FGB’ 
Since y4: is an isomorphism, we have a unique morphism gf; !: FGA' > FGB' making a 
commutative rectangle. Since F is full and faithful, there is a unique fGA — GB in C such that 


Ff) = nf'n, | We define the map G from homp(4-B’) to home(GA, GB) by f ~ f. Then we 
have the commutative rectangle 


i L 


and G(f’) is the only morphism GA' — GB’ such that (2) is commutative. 
Now let g' € homp(B'", C’). Then we have the diagram 


FG") 


FOr’) 


in which the two small rectangles and hence the large one are commutative. Since F is a 
functor, we have the commutative rectangle 


; = 7 


On the other hand, we have the commutative rectangle 


A' Na’ FGA' 
(4) sv F(G(g'f')) 


E Ne FGC' 
and G(g'f'): GA’ GC’ is the only morphism for which (4) is commutative. Hence we have G(g 
NG) = G(g' f). Ina similar manner we see that G(/ 4) = 1¢4. Thus the maps G : A’ ~ GA', G: 
f ~ G(f) for all of the hom sets homp(4’, B") constitute a functor from D to C. Moreover, the 
commutativity of (2) shows that 7’: A' ~ ny, is a natural isomorphism of 1p to FG. 
We observe next that since F is faithful and full, if 4 ,B € ob C and f : FA — FB is an 


isomorphism, then the morphism f : A — B such that F(f/)=/" is an isomorphism (exercise 1, p. 
25). It follows that since ņg4 : FA — FGFA is an isomorphism, there exists a unique 


isomorphism C, : A — GFA such that F(C,) = nzy. The commutativity of (2) for A’ = FA, B' = 
FB, and f' = F(f) where f : A— B in C implies that 


PA Fite) FGFA 
Fi) FGFIP 
FB Fa) FGFB 


is commutative. Since F is faithful, this implies that 


A ù GFA 
Í GRN 
B D GFB 


is commutative. Hence ¢ : A — C, is a natural isomorphism of 1, into GF. O 


As an illustration of this criterion we prove the following very interesting proposition. 


PROPOSITION 1.4. Let R be a ring, and M,,(R) the ring of nxn matrices with entries in R. 
Then the categories mod-R and mod-M,,(R) of right modules over R and M,(R) respectively 
are equivalent. 


Before proceeding to the proof we recall some elementary facts about matrix units in M,(R) 
(BAI, pp. 94-95). For i, j = 1,..., n, we define e; to be the matrix whose (i,j)-entry is 1 and 


other entries are 0, and for any aeR we let a’ denote diag {a,... ,a}, the diagonal matrix in 
which all diagonal entries are a. 


Then we have the multiplication table 


(5) eijêk = Ò kei 
and 

Moreover, 

(7) ae; = eja 


and this matrix has a in the (i,/)-position and 0’s elsewhere. Hence if 


Aii Gy2 ° * * Ain 
Gy; Gy * * * Gy 
(8) A= ; a;,€ R 
Qn} an2 Aan 
then 
— ! — / 


We can now give the 


Proof of Proposition 1.4. Let M be a right R-module and let M” be a direct sum of n copies 
of M (BAL p. 175). If x = (x4, X2, ..., Xp) E€ M™ and A e€ M,(R) as in (8), we define xA to be 
the matrix product 


Gy; Gy. ° * * Ain 
A21 G22 ° ° * An 

(10) uia) me Wary a 
ni n2 Ann 

where 

(11) y= LX A ji, l<i<n 


and the right-hand side is as calculated in M. Using the associativity of matrix multiplication 
(in this mixed case of multiplication of “vectors” by matrices) we can verify that M”) is a right 
M_,(R)-module under the action we have defined. Thus we have a map M ~ M”) of ob mod-R 
into ob mod-M,(R). If f is a module homomorphism of M into N, then the diagonal 
homomorphism /f) : (x), x5, ..., Xp) > FŒ), fx, «--» flx,)) is a homomorphism of the right 
M,(R)-module M” intotheright M,,(R)-module M”), The maps M ~ M”, f > f constitute a 
functor F from mod-R to mod-M,(R). We shall verify that F satisfies the conditions of 
Proposition 1.3. 

1. F is faithful. Clear. 

2. F is full: Let g be an M,(R)-homomorphism of M” into N™® where M and N are right R- 
modules. Now M%e,, is the set of elements (x, 0, ..., 0), x € M, and Ne), is the set of 
elements (y, 0, ..., 0), y € N. Since g is an M,(R)-homomorphism, g(Me,,) c Me,,. Hence 
g(x, 0, ..., 0) = (f(x), 0, ..., 0). Itis clear that fis additive, and f(xa) = f(x)a for a e R follows 
from g((x, 0, ..., 0)a’) = (g(x, 0, ..., 0))a’. Hence f is an R-homomorphism of M into N. Now 
(x, 0, ..., Oe}; = (0, ..., 0, x, 0, ..., 0), so g((x, 0, ..., Oey) = (g(x, 0, ..., 0))e;; implies that 
2(0, ..., 0, x, 0, ..., 0) =(0, ..., 0, Ax), 0,..., 0). Then g =f and F is full. 

3. Any right 7,(R)-module M’ is isomorphic to a module FM, M a right R-module: The 
map a ~ a' is a homomorphism of R into M,,(R). Combining this with the action of M,„(R) on M 
' we make M’ a right R-module in which x'a = x'a’ x'e M'. Then M = M'e,, is an R- 
submodule of M’ since e,,a' = a'e;;, a€R. Moreover, x'e;; = xe;;,e;;€M for any i. We define 
a map ny, :M'— FM = M” by 


(12) X > (X01 1, XC a 150009 % Cnt): 
Direct verification using (9) and the definition of x' a shows that my is an M,(R)- 
homomorphism. If x'e;; =0 for 1 < i < n, then x' = x'e; = )x'ee;, = 0. Hence yy is injective. 
Moreover, yy’, is surjective: If (x,,X5,....X,)€ M, then x; = xe), = (xe),)e, and 
(Xis X25 +3 Xn) = (ONC Oras (X1211)E215 -3 Mer en) 
+((x2212)e1 1, (X2212)}e215 + +3 (X21 2m) + 


Thus yw is an isomorphism. O 


EXERCISES 


1. Let (F, G) be an equivalence of C into D and let f € hom, (4, B). Show that any one of the 
following properties of f implies the same property for F(f) : f is monic, is epic, has a 
section, has a retraction, is an isomorphism. 


2. Are mod-R and mod-M (R) isomorphic for n > 1? 


1.5 PRODUCTS AND COPRODUCTS 


There are many basic constructions in mathematics and especially in algebra—such as the 
(Cartesian) product of sets, the direct product of groups, the disjoint union of sets, and the free 
product of groups—that have simple characterizations by means of properties of maps. This 
fact, which has been known for some time, can be incorporated into category theory in the 
definition and examples of products and coproducts in categories. We shall begin with an 
example: the direct product of two groups. 

Let G = G; X G,, the direct product of the groups G; and G, :G is the group of pairs 
(€122), Z; © G; with the multiplication defined by 


(91592) (hi, h2) = (9yhy, 92h), 
the unit 1 = (1,, 1,) where 1, is the unit of G, and (g), g2)! = (g, 1, g> |). We have the 
projections p; : G — G, defined by 
(13) Pigg) >g P22(G1s92) > 92. 


These are homomorphisms, since P1,((g1.22)(/1,42)) = Pi(gyhy.g2h2) = gh, = (P1162) 
(Pi(hi,h2)) and p,(1) = 1, Similar relations hold for pz. 

Now let H be another group and let f; :H — G; be a homomorphism of H into G;. Then we 
define a map f of H into G = G; x G, by 


h ~ (filh), fa(h)). 
It is clear that this is a homomorphism and P,f(h) = f(A). Hence we have the commutativity of 


H f G 


Pi 
(14) fi 


Gi 


Next let f be any homomorphism of H into G such that pf” =f, i = 1, 2. Then f(h) = 
(f,(h),f(h)) =/(h) so f =f Thus fis the only homomorphism H — G making (14) commutative. 


We now formulate the following 


DEFINITION 1.4. Let A; and A, be objects of a category C. A product of A; and A, in C is 
a triple (A,p,,p2) where A € ob C and P; € homc(4,4;) such that if B is any object in C and f; 
€e hom,(B, A,), i = 1, 2, then there exists a unique f€ hom(,(B,A) such that the diagrams 


B f A 


(15) Pi 


are commutative. 


It is clear from our discussion that if G, and G, are groups, then (G = G, x G),P),P>) is a 
product of G; and G, in the category of groups. The fact that (G, x G2,P1.p2) is a product of G} 
and G, in Grp constitutes a characterization of the direct product G; x G, and the projections 
p;. This is a special case of 


PROPOSITION 1.5. Let (4,p;,p,) and (A'p'ıp')) be products of A; and A, in C. Then there 
exists a unique isomorphism h:A—A' such that P, = p';h, i= 1, 2. 


Proof. If we use the fact'that (4', p', p2) is a product of A, and A5, we obtain a unique 
homomorphism A : A — A'such that p; = p';h, i= 1, 2. Reversing the roles of (A, p, p2) and (A', 
P's p'2) we obtain a unique homomorphism h’ : A’ — A such that p’; = p,h'. We now have p; = 
p;h'h and p'; = p;hh'. On the other hand, if we apply definition 1.4 to B = A and f; = p;, we see 
that 74 is the only homomorphism 4 — A such that p; = p;14. Hence h'h = l} and similarly hh’ 
= ] ,. Thus A is an isomorphism and A'= h+. O 


Because of the essential uniqueness of the product we shall denote any product of A; and A, 
in the category C by A)[ h. The concept of product in a category can be generalized to more 
than two objects. 


DEFINITION 1.4’. Let {A,|a € I} be an indexed set of objects in a category C. We define a 
product | JA, of the A, to be a set {A,p,|ac1} where A € ob C, p, E hom,(4,A4,) such that if B 
€ ob C and f, E hom((B, Aa), a € J, then there exists a unique f€ hom,(B, A) such that every 
diagram 


is commutative. 
We do not assume that the map a ~ A, is injective. In fact, we may have all the A, equal. 


Also, if a product exists, then it is unique in the sense defined in Proposition 1.5. The proof of 
Proposition 1.5 carries over to the general case without change. We now consider some 
examples of products in categories. 


EXAMPLES 


1. Let {A,|a € 7} be an indexed set of sets. We have the product set A = []A4,, which is the 
set of maps a: J — UA, such that for every a € I, a(a) € A,. For each a we have the projection 
Paa a(a). We claim that {4,p,,' is a product of the 4, in the category Set. To see this, let B be a 
set and for each a €e / let f, be a map: B —> A,. Then we have the map f: B — A such that f(b) 
is a — f,). Then pf = fọ and it is clear that fis the only map: B — A satisfying this condition. 
Hence {A, pay satisfies the condition for a product of the A,. 


2. Let {G la € I} be an indexed set of groups. We define a product in G=[]G, by gga) = 
g(a)g'(a) for g, g' €G and we let 1 €G be defined by 1(a) = lẹ, the unit of G, for a € Z. It is 


easy to verify that this defines a group structure on G and it is clear that the projections as 
defined in Set are homomorphisms. As in example 1, {G, p,) is a product of the G, in the 


category of groups. 


3. The argument expressed in example 2 applies also in the category of rings. If {R,|a €Z} 
is an indexed set of rings, we can endow | [R, with a ring structure such that the projections p} 
become ring homomorphisms. Then {[]R,,p,} is a product of the R, in the category Ring. 


4. Ina similar manner we can define products of indexed sets of modules in R-mod for any 
ring R. 


We now consider the dual of the concept of a product. This is given in the following 


DEFINITION 1.5. Let {A, | QEZ} be an indexed set of objects of a category C. We define a 


coproduct A, to be a set {A, i,|a € I} where AE ob C and i,ehom (A,, A) such that if B € ob 
C and g, E€ hom,(A,,B), a E1, then there exists a unique G ehom (A, B) such that every 
diagram 


(16) in 


is commulative. 


It is readily seen that if {4, ila € J} and {4’ i'a € Z} are coproducts in C of {4,|ael}?, 
then there exists a unique isomorphism k :A’— A such that i, = ki’, for all a € J. 

If {4,la €I} is an indexed set of sets, then there exists a set UA, that is a disjoint union of 
the sets 4,. Let i, denote the injection map of A, into UA,. Let B be a set and suppose for each 
a we have a map g, of A, into B. Then there exists a unique map g of UA, into B such that the 
restriction g|A, = g,, a € I. It follows that UA, ia} is a coproduct of the A, in Set. 

We shall show in the next chapter that coproducts exist for any indexed set of objects in 
Grp or in Ring (see p. 84). In the case of R-mod this is easy to see: Let {M,|a € I} be an 
indexed set of left R-modules for the ring R and let | JM, be the product set of the M, endowed 
with the left R-module structure in which for x, y e [] M, and r € R, (x + y) (a) = x(a) + y(a), 
(rx) (a) = r(x(a)). Let ® M, be the subset of ®[ [M, consisting of the x such that x(a) = 0 for all 
but a finite number of the a e 7. Clearly ®M, is a submodule of | [M;. Ifx, E M,, we let ix, 
be the element of ƏM, that has the value x, at a and the value 0 at every B + a, p eI. The map 
ig © Xa  igXq 18 a module homomorphism of M, into ®M,. Now let N € R-mod and suppose 
for every a € I we have a homomorphism g , :M, — N. Let xe;@M,. Then x(a) = 0 for all but 
a finite number of the a; hence } g, ,x(a) is well defined. We define g as the map * ~ >’g,x(°) of 
ƏM, into N. It is readily verified that this is a homomorphism of ƏM, into N and it is the 
unique homomorphism of ®M, into N such that gi, = g,. Thus {®M,, 1,} is a coproduct in R- 
mod of the M,. We call @M, the direct sum of the modules M,. 

Since the category Ab is isomorphic to Z-mod, coproducts of arbitrary indexed sets of 
objects in Ab exist. 


EXERCISES 
1. Let S be a partially ordered set and S the associated category as defined in example 12 


onp. 13. Let {a,|a € I} be an indexed set of elements of S. Give a condition on {a} that 


the corresponding set ol objects in S has a product (coproduct). Use this to construct an 
example of a category in which every finite subset of objects has a product (coproduct) 
but in which there are infinite sets of objects that do not have a product (coproduct). 


. A category C is called a category with a product (coproduct) if any pair of objects in C 
has a product (coproduct) in C. Show that if C is a category with a product (coproduct), 
then any finitely indexed set of objects in C has a product (coproduct). 


. An object A of a category C is called initial (terminal) if for every object X of C, 
hom,(A, X) (hom,(X, A)) consists of a single element. An object that is both initial and 
terminal is called a zero of C. Show that if A and A’ are initial (terminal), then there 
exists a unique isomorphism A in hom,(A, A’). 

. Let A; and A, be objects of a category C and let C/{A,, A,} be the category defined in 
exercise 5 of p. 14. Show that A, and A, have a product in C if and only if C/{A,, A} 
has a terminal object. Note that this and exercise 3 give an alternative proof of 
Proposition 1.5. Generalize to indexed sets of objects in C. 


. Use exercise 6 on p. 15 to give an alternative definition of a coproduct of objects of a 
category. 

. Let f; : A; — B, i = 1, 2, ina category C. Define a pullback diagram of {f;,f,} to be a 
commutative diagram 


C 81 A, 
17 
(17) n f 
A» f B 
such that if 
D hy Ay 
h fi 
A; h B 


is any commutative rectangle containing f; and fọ, then there exists a unique k :D — C 
such that 


is commutative. Show that if (C, g;, g2) and (C’, g'}, g'2) determine pullbacks fı and fọ as 
in (17), then there exists a unique isomorphism k : C’— C such that g'; = gjk, i = 1, 2. 


7. Let f: G; — Hin Grp. Form G; x G, and let M be the subset of G; x G, of elements (a, 
ay) such that f(a) = f>(a>). This is a subgroup. Let m; = p;|M where p; is the projection 
of G; x G, on G;. Show that {m,, m} defines a pullback diagram of f, and f. 


8. Dualize exercise 6 to define a pushout diagram determined by f; : B > A; i = 1, 2, in C. 
Let f,;B — A, i = 1, 2, in R-mod. Form 4;@A). Define the map f-b ~ (;(b),f,(b)) of B 
into A; ® 4. Let I = Imfand put N = (4, ® A,)/I. Define n;:A;-N by nja; = (a;,0) +I, 
na = (0,a,) + I. Verify that {n),n>} defines a pushout diagram for f; and f>. 


1.6 THE HOM FUNCTORS. REPRESENTABLE FUNCTORS 


We shall now define certain important functors from a category C and the related categories 
C? and C°P x C to the category of sets. We consider first the functor hom from C°? x C. We 
recall that the objects of C°P x C are the pairs (4, B), A, Beob C, and a morphism in this 
category from (A, B) to (A’, B’) is a pair (f, g) where f: A' — A and g : B —> B'. If (f, gù) isa 
morphism in C” x C from (4', B’) to (4”, B”),so f : A" > A’, g' : B' — B", then (f, g) f2) = 
(ff, g'g). Also la, 3) = (la 1p). 

We now define the functor hom from C°? x C to Set by specifying that this maps the object 
(A, B) into the set hom (4,B) (which is an object of Set) and the morphism (f; g) : (A, B) —> (A’. 
B’) into the map of hom (A, B) into hom (4’,B’) defined by 


(18) hom (f,g):k > gkf. 


This makes sense since f: A' — A, g : B — B', k : A —> B, so gkf: A'— B'. These rules define 
a functor, since if (f, g): (4", B") — (A', B’), then (f, 214 g) = (ff, g'g) and 


hom (ff',g'g) (k) = (g'g)k(U ff’) 
= g'(gkf f" 
= g hom (f,g) (k)f’ 


= hom (f’,g’) (hom ( f,g) (k)). 


Thus hom (f,g') (£g)) = hom (f,g') hom (f,g). Moreover, if = 14 and g= 1p, then (18) shows 
that hom (1,4, Zp) is the identity map on the set hom (4, B). Thus the defining conditions for a 


functor from C? x C to Set are satisfied. 
We now fix an object A in C and we define a functor hom (4, —) from C to Set by the rules 


hom (A, —)B = hom (4, B) 


hom (A, — ) (g) for g:B > B’ is the map 


(19) hom (A,g):k > gk 


of hom(A, B) into hom(4,B’'). It is clear that this defines a functor. We call this functor the 
(covariant) hom functor determined by the object AinC. 


In a similar manner we define the contravariant hom functor hom (— ,B) determined by B 
€ ob C by 


hom (—,B)A = hom (4A, B) 
hom (—,B)(f) for f: A’ A is the map 


(20) hom (f,B):k ~ kf 

of hom (A, B) into hom (4', B). Now let f: A'— A, g : B — B' k : A — B. Then 
hom (f, B') hom (A, g) (k) = (gk)f 
hom (A’,g) hom (f, B) (k) = g(kf) 

and (sk) f= g(kf) = hom (f, g) (k). Hence 


hom (A,B) hom (A,g) hom (A,B’) 


(21) hom (f,B) hom (f,B’) 


hom (A’,B) hom (4’,g) hom (A‘',B’) 


is commutative. We can deduce two natural transformations from this commutativity. First, fix 
g : B — B' and consider the map, A ~ hom(A,g) € homg,,(hom (4, B) hom (4,B')) The 
commutativity of the foregoing diagram states that A ~ hom/(A,g) is a natural transformation of 
the contravariant functor hom(—.,B) into the contravariant functor hom( — ,B’)o Similarly, the 
commutativity of (21) can be interpreted as saying that for f : A' — A the map B ~ hom (fB) is 
a natural transformation of hom (A, —) into hom (4', —). 

In the applications one is often interested in “representing” a given functor by a hom 
functor. Before giving the precise meaning of representation we shall determine the natural 
transformations from a functor hom,(A, —), which is a functor from C to Set, to any functor F 


from C to Set. Let a be any element of the set FA and let Be ob C, k e hom,(A,B). Then F(k) is 
a map of the set FA into the set FB and its evaluation at a, F(k) (a) € FB. Thus we have a map 


(22) ag:k œ> F(k) (a) 
of hom,(A, B) into FB. We now have the important 


YONEDA’S LEMMA. Let F be a functor from C to Set, A an object of C, a an element of 
the set FA. For any B e ob C let ap be the map of hom,(A,B) into FB such that k ~ f(k)(a). 


Then B ~ ap is a natural transformation n(a) of hom,(A, —) into F Moreover, a ~ n(a) is a 
bijection of the set FA onto the class of natural transformations of hom,(A, —) to F The 
inverse of a ~ n(a) is the map yn ~ n 414) € FA. 


Proof. We have observed that (22) is a map of hom,(A,B) into FB. Now let g:B — C. Then 
F(g)ag(k) = F(g)F(k) (a) = F(gk) (a) 
aç hom (A,g) (k) = a¢(gk) = F(gk) (a). 


Hence we have the commutativity of 


hom (A,B) a FB 


hom (A, — Jg) Fig) 


hom (A.C) Qe Fc 


Then (a) :B — ap is a natural transformation of hom (A, —) into F. Moreover y(a)(14) = 
ag(1 4) =fUla)(aj=a. 

Next let 7 be any natural transformation of hom (4, —) into F. Suppose f e hom,(4,B). The 
commutativity of 


bom (A.A) "a FA 
hom (Af) PW) 
hom (A,B) Ts FB 


implies that n30) = np(f Li) = ng(hom (4,f) (14) = Fna(1) = fÓ) (a) where a = ny(le FA. 
This shows that 7 = 7(a) as defined before. 
The foregoing pair of results proves the lemma. O 


We shall call a functor F from C to Set representable if there exists a natural isomorphism 
of F with a functor hom (4, —) for some Aeob C. If 7 is this natural isomorphism then, by 
Yoneda’s lemma, 7 is determined by A and the element a = 1 ,(/,) of FA. The pair (A, a) is 


called a representative of the representable functor F. 


EXERCISES 


1. Apply Yoneda’s lemma to obtain a bijection of the class of natural transformations of 
hom (A, —) to hom (4’—) with the set home (A’, A). 


2. Show that f:B — B'is monic in C if and only if hom (4,/) is injective for every A € obC. 


3. Dualize Yoneda’s lemma to show that if F is a contravariant functor from C to Set and A 
€ ob C then any natural transformation of hom;(—, A) to F has the form B ~ ag where ap 


is a map of hom, (B, A) into FB determined by an element a € FA as 
dik > F(k)a. 


Show that we obtain in this way a bijection of the set FA with the class of natural 
transformations of hom, (— ,A) to F. 


1.7 UNIVERSALS 


Two of the earliest instances of the concept of universals are those of a free group determined 
by a set X and the universal (associative) enveloping algebra of a Lie algebra. We have 
considered the first for a finite set X in BAI, pp. 68—69, where we constructed for a set X, of 
cardinality r < œ, a group FG” and a map i :x —> X of X into FG” such that if G is any group 
and g is a map of X into G then there exists a unique homomorphism g: FG” — G, making the 
diagram 


commutative. Here 9' is regarded as a map of sets. 

We recall that a Lie algebra L over a field is a vector space equipped with a bilinear 
product [xy] such that [xx] = 0 and [[xy]z] +[[vz]x] + [[zx]y] = 0. If A is an associative algebra, 
A defines a Lie algebra A” in which the composition is the Lie product (or additive 
commutator) [xy] = xy-vx where xy is the given associative product in A (BAI, pp. 431 and 
434). It is clear that if A and B are associative algebras and f is a homomorphism of A into B, 
then fis also a Lie algebra homomorphism of A” into B~. 

If L is a Lie algebra, a universal enveloping algebra of L is a pair (U(L),u) where U(L) is 
an associative algebra and u is a homomorphism of L into the Lie algebra U(L)~ such that if g 
is any homomorphism of L into a Lie algebra A” obtained from an associative algebra A” then 
there exists a unique homomorphism 9' of the associative algebra U(L) into A such that 


L a UL) 


8g 


A- 


is a commutative diagram of Lie algebra homomorphisms. We shall give a construction of 
(U(L),u) in Chapter 3 (p. 142). 

Both of these examples can be formulated in terms of categories and functors. In the first we 
consider the categories Grp and Set and let F be the forgetful functor from Grp to Set that 
maps a group into the underlying set and maps a group homomorphism into the corresponding 
set map. Given a set X, a free group determined by X is a pair (U,u) where U is a group and u 
is a map of X into U such that if G is any group and g is a map of X into G, then there exists a 
unique homomorphism 9' of U into G such that fu = G holds for the set maps. 

For the second example we consider the category Alg of associative algebras and the 
category Lie of Lie algebras over a given field. We have the functor F from Alg to Lie defined 
by FA = A for an associative algebra A, and if f:A — B for associative algebras, then Ff) = 
fA — B for the corresponding Lie algebras. For a given Lie algebra L, a universal envelope 
is a pair (U(L),u) where U(L) is an associative algebra and u is a Lie algebra homomorphism 


of L into U(L) such that if g is any homomorphism of L into a Lie algebra 4’, A associative, 
then there exists a unique homomorphism 9: of U(L) into A such that 9'u = g. 
We now give the following general definition of universals. 


DEFINITION 1.6. Let Cand D be categories, F a functor from C to D. Let B be an object 
in D. A universal from B to the functor F is a pair (U, u) where U is an object of C and u is a 
morphism from B to FU such that if g is any morphism from B to FA, then there exists a 


unique morphism 9 of U into A in C such that 


B ud FU 


FA 


is commutative. U is called a universal C-object for B and u the corresponding universal 
map. 


It is clear that the two examples we considered are special cases of this definition. Here 
are some others. 


EXAMPLES 


1. Field of fractions of a commutative domain. Let Dom denote the subcategory of the 
category Ring whose objects are commutative domains ( = commutative rings without zero 
divisors 40) with monomorphisms as morphisms. Evidently this defines a subcategory of Ring. 
Moreover, Dom has the full subcategory Field whose objects are fields and morphisms are 
monomorphisms. If D is a commutative domain, D has a field of fractions F (see BAI, p. 115). 
The important property of F is that we have the monomorphism u:a ~ a/1 of D into F, and if g 
is any monomorphism of D into a field F’, then there exists a unique monomorphism 9' of F into 
F’ such that g = gu. We can identify D with the set of fractions d/1 and thereby take u to be the 
injection map. When this is done, the result we stated means that any monomorphism of D (c 
F) has a unique extension to a monomorphism of F into F’ (see Theorem 2.9, p. 117 of BAD. 

To put this result into the mold of the definition of universals, we consider the injection 
functor of the subcategory Field into the category Dom (see example 1 on p. 20). If D is a 
commutative domain, hence an object of Dom, we take the universal object for D in Field to be 
the field f of fractions of D and we take the universal map u to be the injection of D into F. 
Then (F, u) is a universal from D to the injection functor as defined in Definition 1.6. 


2. Free modules. Let R be a ring, X a non-vacuous set. We define the free (left) R-module ® 
xR to be the direct sum of X copies of R, that is, ®,R= ®M,, ae X, where every M, = R. Thus 


@,R is the set of maps f of X into R such that f(x) = 0 for all but a finite number of the x’s. 
Addition and the action of R are the usual ones: (f+g)(x) = f(x) + g(x), (rf) (x) = rf(x). We have 


the map u: x~ X of X into @ R where X is defined by 
X(x)=1, xy)=0 if y#x. 


Iffe @, Rand {x,,..., Xp} is a subset of X such that f(y) = 0 for yg {x),... Xn}, then f= r; X; 
where f(x) = r;. Moreover, it is clear that for distinct X ; $r; X = 0 implies every r; = 0. Hence 
the set X = { X ke X} is a base for ®,R in the sense that every element of this module is a sum 
X, vex" ¥* which is finite in the sense that only a finite number of the r, are + 0, and Y’x * = 0 
for such a sum implies that every r, = 0. 

Now suppose M is any (left) R-module and ọ : x ~ m, is a map of X into M. Then 


P yr, X >F, 


is a well-defined map of @ „R into M. It is clear that this is a module homomorphism. 
Moreover, #( X ) = m, so we have the commutativity of 


x u @,R 


M 


Since a module homomorphism is determined by its restriction to a set of generators, it is clear 
that è is the only homomorphism of the free module ®,R into M, making the foregoing diagram 
commutative. 

Now consider the forgetful functor F from R-mod to Set that maps an R-module into its 
underlying set and maps any module homomorphism into the corresponding set map. Let X be a 
non-vacuous set. Then the results mean that (®@,R,u) is a universal from X to the functor F. 


3. Free algebras and polynomial algebras. If K is a commutative ring, we define an 
(associative) algebra over K as a pair consisting of a ring (A, +,.,0, 1) and a K-module 4 such 
that the underlying sets and additions are the same in the ring and in the module (equivalently, 
the ring and module have the same additive group), and 


a(xy) = (ax)y = x(ay) 


for aEK, x,yeA (cf. BAI, p. 407). The algebra A is said to be commutative if its multiplication 
is commutative. Homomorphisms of K-algebras are K-module homomorphisms such that 1 ~ 
1, and if x ~ x’ and y ~ y’ then xy ~ x'y’. We have a category K- alg of K-algebras and a 
category K-comalg of commutative K-algebras. In the first, the objects are K-algebras and the 
morphisms are K-algebra homomorphisms. K-comalg is the full subcategory of commutative 
K-algebras. 

We have the forgetful functors from K-alg and K-comalg to Set. If X is a non-vacuous set, 


then a universal from X to the forgetful functor has the form (K{X},u) where K{X} is a K- 
algebra and u is a map of X into K{X} such that if g is any map of X into a K-algebra A, then 


there is a unique K-algebra homomorphism f: of K{X} into A such that fu = g (as set maps). If 
(K{X},u) exists, then K{X} is called the free K-algebra determined by X. In a similar manner 
we can replace K-alg by K-comalg. A universal from X to the forgetful functor to Set is 
denoted as (KLX],u). If X = {%),%2, ..., Xp}, we let K/X7,...,X,,/ be the polynomial ring in the 
indeterminates X; with coefficients in K. If g is a map of X into a commutative K-algebra A, 
then there is a unique homomorphism of KLX}..., X,] into A as K-algebras such that X; > g(x,), 
1 <i <n (BAI, p. 124). Hence KLX),...,X,,]and the map x; ~ X; constitute a universal from X to 
the forgetful functor from K-comalg to Set. 


4. Coproducts. Let C be a category. We have the diagonal functor 4 of C to C x C that maps 
an object A of C into the object (4, A) of C x C and a morphism f:4—>B into the morphism (f, 
f):(A,A) — (B,B). A universal from (4;,A,) to 4 is a pair (U,u) where U is an object of C and u 
= (uuz), u; A; >U such that if C € obC and g; A— C, then there is a unique ĝ': U — C such 
that g; = fu; i = 1, 2. This is equivalent to saying that (U,u juz) is a coproduct in C of A; and 
A,. This has an immediate generalization to coproducts of indexed sets of objects of C. If the 
index set is J = {a}, then a coproduct u C., is a universal from (4,) to the diagonal functor 
from C to the product category C! where C! is the product of /-copies of C (cf. p. 21). 


One can often construct universals in several different ways. It is immaterial which 
determination of a universal we use, as we see in the following strong uniqueness property. 


PROPOSITION 1.6. If (U,u) and (U' u') are universals from an object B to a functor F then 
there exists a unique isomorphism h:U — U' such that u' = f(h)u. 


We leave the proof to the reader. We remark that this will follow also from exercise 4 
below by showing that (C, u) is the initial element of a certain category. 

As one might expect, the concept of a universal from an object to a functor has a dual. This 
is given in 


DEFINITION 1.7. Let C and D be categories, G a functor from D to C. Let A € ob C. A 
universal from G to A is a pair (Vv) where V € ob D and vehom,(GV,A) such that if B € ob D 


and G ehom,(GB. A), then there exists a unique 9':B — V such that vG(9') = G. 


As an illustration of this definition, we take C = R-mod, D = Set as in example 2 above. If 
Xis a non-vacuous set, we let GX = @,R, the free module determined by X. It is convenient to 


identify the element x € X with the corresponding element X of @ R and we shall do this from 
now on. Then @,R contains X and X is a base for the free module. The basic property of X is 
that any map ¢ of X into a module M has a unique extension to a homomorphism of @,R into M. 


If X = Ø, we define ®@,R = 0. If X and Yare sets and ¢ is a map of X into Y, then ¢ has a unique 
extension to a homomorphism ¢ of Ə R into ®yR. We obtain a functor G from Set to R-mod by 
putting G(X) = @ R and GØ) = ¢. 

Now let M be a (left) R-module, FM the underlying set so GFM = @ppR. Let v be the 
homomorphism of GFM into M, extending the identity map on FM. Let X be a set and g a 
homomorphism of @,R into M, and put 9' = g/X. Then G(g) is the homomorphism of @,R into 
@pyR such that GÀ) (x) = g(x), xeX, and vG(9')(x) = g(x). Hence vG/(9') = g. Moreover, 9' is 


the only map of X into FM satisfying this condition. Hence (FM,v) is a universal from the 
functor G to the module M. 


EXERCISES 


1. Let D be a category, 4 the diagonal functor from D to D — D. Show that (Vv), v = (v1,V9), 
is a universal from 4 to (4,,A,) if and only if (Vv;,v2) is a product of 4; and A, in D. 


2. Let Rng be the category of rings without unit, F the functor from Ring to Rng that forgets 
the unit. Show that any object in Rng possesses a universal from it to F. (See BAI, p. 
155.) 


3. Let F; be a functor from C; to C,, F, a functor from C, to C3. Let (U>,u2) be a universal 
from B to F,, (U;,u, a universal from U; to F. Show that (U;,,F'>(U,) U3) is a universal 
from B to FF}. 


4. Let F be a functor from C to D, B an object of D. Verify that the following data define a 
category D(B,F): The objects are the pairs (4,g) where A € obC and gehomp(B,FA). 
Define hom ((4,g) (A'g')) as the subset of hom,(4, 4’) of h such that g’ = f(h)g and 
arrange to make these hom sets non-overlapping. Define multiplication of morphisms as 
in C and l4 g) = 14. Show that (U, u) is a universal from B to F if and only if (U, u) is an 
initial element of D(B, F). Dualize. 


1.8 ADJOINTS 


We shall now analyze the situation in which we are given a functor F from C to D such that for 
every object B of D there exists a universal (U, u) from B to F. The examples of the previous 
section are all of this sort. First, we need to consider some alternative definitions of 
universals. 

Let F be a functor from C to D, B an object of D, (U,u) a pair such that Ueob C and 
ue homp(B,FU). Then iffehom,(U,A), f(H_ homp(FU,FA4) and Ff)uehomp(B, FA). Accordingly, 
we have the map 


(23) Naif F(f)u 


of the set hom-(U,A) into the set homp(B,F,4). By definition, (U, u) is a universal from B to F 


if for any object A of C and any morphism g:B — FA in D there is one and only one 9':U — A 


such that g = F(9')u. Evidently this means that (U, u) is a universal from B to f if and only if for 
every A €ob C the map 7, from hom-(U,A) to homp(B, FA) given in (23) is bijective. If this is 


the case and h: A — A' in C, then the diagram 


home (U,A) M homp (B,FA) 
home (U,A) homp (8, F{h)) 
home (U,A") Ta’ homp (8,FA') 


is commutative, since one of the paths from an fe hom,(U, A) gives F(hf)u and the other gives 
F(h)Ff)u, which are equal since F(hf) = f(h)F(f). It follows that n:A ~ na is a natural 
isomorphism of the functor hom,(U, —) to the functor homp(B, F —) that is obtained by 
composing F with the functor homp(B, —). Note that both of these functors are from C to Set, 
and since ny(ly) = FU y)u = 1 pyu = u, the result is that homp(B,F —) is representable with (U, 
u) as representative. 

Conversely, suppose homp(B, f —) is representable with (U, u) as representative. Then, by 
Yoneda’s lemma, for any object A in C, the map 7, of (23) is a bijection of hom,(U, A) onto 
homp(B, F4). Consequently (U,u) is a universal from B to F. 


Similar considerations apply to the other kind of universal. Let G be a functor from D to C, 
A an object of C, (V, v) a pair such that Veob D and v € hom,(GV,A). Then for any B € ob D 


we have the map 


(24) pig > vG(g) 


of hom,(B, V) into hom,(GB, 4), and (Vv) is a universal from G to A if and only if Cp is a 
bijection for every BeobD. Moreover, this is the case if and only if ¢:B ~ Cp is a natural 
isomorphism of the contravariant functor homp(—, V) with the contravariant functor hom,(G -, 
A) obtained by composing G with hom,( — A). 

We now assume that for every Be ob D we have a universal from B to the functor F from C 
to D. For each B we choose a universal that we denote as (GB, up). Then for any A € ob C we 
have the bijective map 


(25) NBA f~ F(f)ug 


of home (GB,A) onto homp(B,FA), and for fixed B, A ~ ng 4 is a natural isomorphism of the 
functor home (GB, —) to the functor homp(B,F —). 


Let B' be a second object in D and let A :B — B'. We have the diagram 


# h B’ 
us Ma: 
FGB FGB' 


SO upghehomp (B, FGB'), and since (GB,ug) is a universal from B to F there is a unique 


morphism G(h): GB — GB' such that the foregoing diagram becomes commutative by filling in 
the horizontal F'G(h): 


(26) FG(h)u, = ugh. 


The commutativity of the diagram 


B h B’ k B’ 
FGB FGth) FGB’ FG(k) FGB” 
and the functorial property (FG(k)) (FG(h)) = F(G(k)G(h)) imply that 
B kh B 
FGB FIGIK)GIh)) FGB” 


is commutative. Since G(kh) is the only morphism GB — GB" such that FG(kh)up = upkh, it 
follows that G(kh) = G(k)G(A). In a similar manner we have G(1}) = lg. Thus G is a functor 
from D to C. 


We wish to study the relations between the two functors F and G. Let AE ob C. Then F Ae 
ob D, so applying the definition of a universal from FA to F to the map g = /;,, we obtain a 


unique v ,GFA — A such that 


Then for any B € ob D we have the map 
(28) C4,B:9 > U4G(g) 


of homp(B, FA) into hom,(GB, 4). Now let g e homp(B, F4). Then 
Np.aba.h9) = F(V4G(g))ug = F(V4) (FG(g) ug = Flvg)upag = 9 


(by (26) and (27)). Since ng, is bijective, it follows that ¢4 2 = ng 4 / and this is a bijective 
map of homp(B, F4) onto hom,(GB, A). 

The fact that for every B € ob D, Cy g:g ~ v4G(g) is a bijective map of homp(B, FA) onto 
hom,(GB, A) implies that (F4, V4) is a universal from G to A, and this holds for every AeEobC. 
Moreover, for fixed A, B ~ Cy g is * natural isomorphism of the contravariant functor homp( —, 
FA) to the contravariant functor hom,(G — A). Consequently, B ~ g4 = Cy g is a natural 
isomorphism of hom,(G —, A) to homp( —, FA). 

We summarize the results obtained thus far in 


PROPOSITION 1.7. Let f be a functor from C to D such that for every B € obD) there is a 
universal from B to F. For each B choose one and call it (GB,u,). If h :B — B' in D, define 


G(h) :GB — GB' to be the unique element in hom -(GB,GB’) such that /(G(h))ug = ug-h. Then 
G is a functor from D to C. If Ae obC, there is a unique v,:;GFA—A such that f(vq)Uyq = 1ra- 
Then (FA,v4) is a universal from G to A. If ng 4 is the map f ~ ff)ug of home(GB, A) into 
homp(B, FA), then for fixed B, A ~ ņg4 is a natural isomorphism of hom,(GB, —) to 
homp(B,F —) and for fixed A, B ng 4 is a natural isomorphism of hom,(G — ,A) to homp( —, 
FA). 


The last statement of this proposition can be formulated in terms of the important concept of 
adjoint functors that is due to D. M. Kan. 


DEFINITION 1.8. Let f be a functor from C to D, G a functor from D to C. Then F is 
called a right adjoint of G and G a left adjoint of f if for every (B, A), B € obD, A € obC, we 
have a bijective map np 4 of hom,(GB,A) onto homp(B,FA) that is natural in A and B in the 


sense that for every B, A ~ np 4 is a natural isomorphism of hom,(GB, —) to homp(B,F —) 
and for every A, B np 4 is a natural isomorphism hom,(G —,A) to homp( —, FA). The map q: 
(B, A) ~ Ng 4 is called an adjugant from G to F, and the triple (F, G, n) is an adjunction. 


Evidently the last statement of Proposition 1.7 implies that (F'G,7) is an adjunction. We 
shall now show that, conversely, any adjunction (F;G,7) determines universals so that the 
adjunction can be obtained from the universals as in Proposition 1.7. 

Thus suppose (F, G, 77) is an adjunction. Let B € obD and put “B = np ga(lgg) € homp(B, 
FGB). Keeping B fixed, we have the natural isomorphism A ~ 74, , of hom ¢(GB, —) to 
homp(B, F —). By Yoneda’s lemma, iff € hom¢(GB,A) then 


Ng.a(f) = homp(B, F(/))ug = F(f)ug. 


Then f ~ ff)up is a bijective map of hom,(GB, A) onto homp,(B,FA). The fact that this holds 
for all AeobC implies that (GB, Up) is a universal from B to F. 


Let k e hom,(4, 4’), f e hom (GB, A) g e hom,(GB'A) hehomp)(B, B"). The natural 
isomorphism A ~ 7, 4 of hom,(GB, —) to homp(B,F—) gives the relation 


and the natural isomorphism B ~ Ng 4 of hom,(G —, A) to homp( — FA) gives 
(30) Np,a(gG(h)) = ny, 4(g)h. 


(Draw the diagrams.) These imply that 
F(G(h))u, = F(G(h) ng.¢a(leg) = Ne.cp(G(h)l gg) 
(by (29) with A = GB, A’ = GB', k = G(h), f = Ica) 


= Nace ge Gh) = ny ge (leg h = Ugh 
(by (30) with A = GB’, g = l gp’) 


The relation F(G(h))up = up.h, which is the same as that in (26), shows that the given functor 
G is the one determined by the choice of the universal (GB, up) for every BeobD. We have 
therefore completed the circle back to the situation we considered at the beginning. 

An immediate consequence of the connection between adjoints and universals is the 
following 


PROPOSITION 1.8. Any two left adjoints G and G' of a functor f from C to D are naturally 
isomorphic. 


Proof. Let 7 and 7' be adjugants from G and G’ respectively to F. For Be obD, (GB, Up) and 
(G'B, U'z), where up and u'g are determined as above, are universals from B to F. Hence there 
exists a unique isomorphism Ap .GB — G'B such that u's = F(Ap)up (Proposition 1.5). We shall 
be able to conclude that A :B ~ Ap is a natural isomorphism of G to G' if we can show that for 
any h :B — B' the diagram 


GB An G'B 
Gih) G'th) 
GB’ rE G'B' 


is commutative. To see this we apply 7 gp to G'(h)Ag and AgG(h). This gives 


F(G'(h))F(A,)ug = F(G'(h))u, = uh (by (26)) 
F(A, )F(G(h))uy = F(Ag ugh = ugh (by (26)). 


Since 7g Gg is an isomorphisM, it follows that G'(h)Ap = AgG(h) and so the required 
commutativity holds. 


Everything we have done dualizes to universals from a functor to an object. We leave it to 
the reader to verify this. 


EXERCISES 

1. Determine left adjoints for the functors defined in the examples on pp. 42—44. 

2. Let (E G, y) be an adjunction. Put ug = Ng Gg] gpg) ehomp(B, FGB). Verify that U:B ~ 
Up is a natural transformation of lp to FG. u is called the unit of the adjugant 7. 
Similarly, v-A ~ v4 = Npa, 4 (lra) ° natural transformation of GF to 1,. This is called 
the co-unit of 7. 

3. Let (G, Fp) be an adjunction where F,:C,;—C,, G):C,—-C), and let (G,,F>,n2) be an 
adjunction where F, :C, — C3, Gy :C3 — C,. Show that GG, is a left adjoint of FF; 
and determine the adjugant. 


4. Let (G, Fm) be an adjunction. Show that F:C — D preserves products; that is, if Æ = 
[JA, with maps p,:A — Ax then FA = [ [FA with maps F(p,). Dualize. 
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Universal Algebra 


The idea of “universal” algebra as a comparative study of algebraic structures appears to have 
originated with the philosopher-mathematician Alfred North Whitehead. In his book A Treatise 
on Universal Algebra with Applications, which appeared in 1898, Whitehead singled out the 
following topics as particularly worthy of such a comparative study: “Hamilton’s Quaternions, 
Grassmann’s Calculus of Extensions and Boole’s Symbolic Logic.” Perhaps the time was not 
yet ripe for the type of study that Whitehead undertook. At any rate, the first significant results 
on universal algebra were not obtained until considerably later—in the 1930’s and 1940’s, by 
G. Birkhoff, by Tarski, by Jonsson and Tarski, and others. 

The basic concept we have to deal with is that of an “Q-algebra,” which, roughly speaking, 
is a non-vacuous set equipped with a set of finitary compositions. Associated with this concept 
is an appropriate notion of homomorphism. We adopt the point of view that we have a set Q of 
operator symbols (for example, +, -, ^ that is given a priori, and a class of non-vacuous sets, 
the carriers, in which the operator symbols are realized as compositions in such a way that a 
given symbol œ is realized in every carrier as a composition with a fixed “arity,” that is, is 
always a binary composition, or a ternary composition, etc. The class of Q-algebras for a fixed 
Q constitutes a category with homomorphisms of Q-algebras as the morphisms. Categorical 
ideas will play an important role in our account. 

The concept of Q-algebras can be broadened to encompass relations as well as operations. 
In this way one obtains the basic notion of a relation structure that serves as the vehicle for 
applying mathematical logic to algebra. We shall not consider the more general concept here. 

We shall develop first some general results on homomorphisms and isomorphisms of Q- 
algebras that the reader has already encountered in special cases. Beyond this, we introduce 
the concept of a subdirect product of algebras and the important constructions of direct limits, 
inverse limits, and ultraproducts of Q-algebras. We shall be particularly interested in varieties 
(or equational classes) of Q-algebras, free algebras, and free products in varieties, and we 
shall prove an important theorem due to G. Birkhoff giving an internal characterization of 
varieties. The important special case of free products of groups and of free groups will be 


treated in a more detailed manner. 


2.1 Q-ALGEBRAS 


We review briefly some set theoretic notions that will be useful in the study of general (or 
universal) algebras. 

If A and B are sets, then one defines a correspondence from A to B to be any subset of A x 
B. A map f: A— B is a correspondence such that for every a € A there is a b in B such that (a, 
b) € f, and if (a, b) and (a, b’) € f, then b = b’. The uniquely determined b such that (a, b) € fis 
denoted as f(a). This gives the connection between the notation for correspondences and the 
customary function or map notation. If ® is a correspondence from A to B, the inverse 
correspondence ® from B to A is the set of pairs (b, a) such that (a, b) € ®. If ® is a 
correspondence from A to B and ¥ is a correspondence from B to C, then ¥ ® is defined to be 
the correspondence from 4 to C consisting of the pairs (a, c), a € A, c € C, for which there 
exists a b € B such that (a, b) € ® and (b, c) e Y. If fis a map from A to B and g is a map 
from B to C, then the product of the correspondence gf is the usual composite of g following f: 
(gf)(a) = g(f(a)). As usual, we denote the identity map on the set A by 1,. This is the 
correspondence from A to A consisting of the elements (a, a), a € A. It is also called the 
diagonal on the set A. If ® is a correspondence from A to B, ¥ a correspondence from B to C, 
© a correspondence from C to D, then the following set relations are readily checked: 


(1) (OP)O = O(¥O). 
(2) (YO) =O Yo! 
(3) (071)! =o. 
(4) O1,=0= 1,0. 


The set of maps from A to B is denoted as B4. If A= {1, 2, ..., n} we write B™ for B4 and 
call its elements n-tuples (or sequences ofn elements) in the set B. We identify B with B. We 
define an n-ary relation R on the set A to be a subset R of A”. Thus a binary (n = 2) relation 
on A is a correspondence from A to A. An alternative notation for (a,b) € R is aRb. In this case 
we Say that a is in the relation R to b. 

An equivalence relation E on A is a binary relation that is reflexive: aEa for every ae A; 
symmetric: if aEb then bEa; and transitive: if aEb and bEc then aEc. These conditions can be 
expressed more concisely in the following way: 


1,cE (reflexivity). 
E =E! (symmetry). 
EEcE (transitivity). 


The element of the power set A(A) (set of subsets of A) consisting of the elements b such that 
bEa is denoted as a, or ā if E is clear. The set of these @ constitutes a partition of the set A 


called the quotient set A/E of A with respect to the equivalence relation E (see, for example, 
BAI, p. 11). By a partition of A we mean a set of non-vacuous subsets of A such that A is their 
disjoint union, that is, A is their union and the intersection of distinct subsets is vacuous. There 
is a 1-1 correspondence between partitions of A and equivalence relations on A (BAI, pp. 11- 
12). 

Another important type of relation on a set A is that of a partial order >. This is defined by 
the following properties: a > a (reflexivity); if a > b and b= a, then a = b (anti-symmetry); and 
ifa > b and b > c, then a > c (transitivity). If we write O for the subset of A x A of (a, b) such 
that a > b, then these conditions are respectively: 


O = P 
0n go C l4. 
00 c 0. 
Ifn is a positive integer, we define an n-ary product on A to be a map of A” to A. Thus the 


n-ary products are just the elements of the set A” Ten = 1, 2, 3, etc., we have products that 


are unary, binary, ternary, etc. It is convenient to introduce also nullary products, which are 
simply distinguished elements of A (e.g., the element 0 of a ring). We can extend our notation 


AA” to include A., which is understood to be A, the set of nullary products on 4. Thus we 
have nary products for any n € N= {0, 1,2, 3,...}. Ifn > 1 and o € Aa” SO œ is a map (a), a, 
..+5 Ay)  O(dj,...,a,) € A, then the set of elements of the form 


(isa... Ap Olai., Ap) 


is a subset of A” D and this is an (n+1)-ary relation on A. Since the nullary products are just 
the elements of A, we see that for any n € N the notion of (n + 1)-ary relation includes that of n- 
ary product on A. 

We are now ready to define the concept of a “general” algebra or Q-algebra. Roughly 
speaking, this is just a non-vacuous set A equipped with a set Q of nary products, n = 0, 1,2,.... 
In order to compare different algebras—more precisely, to define homomorphisms—it is 
useful to regard Q as having an existence apart from A and to let the elements of Q determine 
products in different sets A, B,... in such a way that the products determined by a given we Q 
in A,B,... all have the same arity, that is, are n-ary with a fixed n. We therefore begin with a set 


Q together with a given decomposition of Q as a disjoint union of subsets Q(), n = 0, 1,2,.... 
The elements of Q(7) are called n-ary product (or operator) symbols. For the given Q and 


decomposition Q = UAn) we introduce the following 


DEFINITION 2.1. An Q-algebra is a non-vacuous set A together with a map of Q into 
products on A such that if wEQ(n), then the corresponding product is nary on A. The 
underlying set A is called the carrier of the algebra, and we shall usually denote the algebra 
by the same symbol as its carrier. 


If @ is nullary, so we Q(0), the corresponding distinguished element of A, is denoted as œ; 
or, if there is no danger of confusion, as œ (e.g., the element 1 in every group G rather than 16). 
Ifn > 1 and we Q(n), the corresponding product in A is 


(ai, A5,...,4,) > O(A;,43,...,4,), 
aje A. We shall now abbreviate the right-hand side as 


WA ;Qy*** A, 


thus dropping the parentheses as well as the commas. The observation that this can be done 
without creating ambiguities even when more than one operator symbol occurs is due to 
Lukasiewicz. For example, if œ is 5-ary, ọ ternary, w unary, and å nullary, then 


ola, ’ ay, was, A, l4, ls, WW(d6))) 


becomes @aa,Wa7Aa4,a5wd¢ in Lukasiewicz’s notation, and it is easily seen that if we know 
that g is ternary, œ is 5-ary, 2 is nullary, and y is unary, then the displayed element is the only 
meaning that can be assigned to ga);a,wa3hazasyadg. The reader should make a few more 


experiments with this notation. We shall return to this later in our discussion of free Q- 
algebras. We remark also that we do not exclude the possibility that œ + œ' in Q(n), but the 
associated n-ary products in A are identical. 


EXAMPLES 


1. A monoid is an Q-algebra with Q = {p, 1} where p is binary, 1 is nullary, satisfying the 
following identities (or laws): ppabc = papbc (the associative law for p) and pal =a = pla. 


2. A group is an Q-algebra with Q = {p, 1, i} where p is binary, 1 is nullary, and 7 is unary. 
Corresponding to the group axioms we have the following identities: ppabc = papbc, pal =a 
= pla, paia = 1, piaa = 1. Thus ia is the usual a~/. It is necessary to introduce this unary 
operation to insure that the general theory of Q-algebras has a satisfactory specialization to the 
usual group theory. 


3. A ring is an Q-algebra with Q = {s, p, 0, 1, —} where s and p are binary, 0 and 1 are 


nullary, and — is unary. Here s gives the sum, p is the product, 0 and 1 have their usual 
significance, and —a is the negative of a. We leave it to the reader to formulate the identities 
that are required to complete the definition of a ring. For example, one of the distributive laws 
reads pasbc = spabpac. 


4. Groups with operators. This concept is designed to study a group relative to a given set 
of endomorphisms. Examples are the sets of inner automorphisms, all automorphisms, all 
endomorphisms. From the point of view of Q-algebras, we have a set Q = A Ufp, 1, i} where 
A is a set of unary operator symbols distinct from i and p, 1, i are as in the definition of groups. 
Besides the group conditions on p, 1, i we assume that if4 € A, then 


(5) ¿pab = piadb 
or, in the usual notation, 
(5) }(ab) = (Aa) (Ab). 


Thus a ~ Aa is an endomorphism of the group. This type of algebra is called a group with 
operators or a A-group. 

Any module M (left or right) for a ring R can be regarded as an abelian group with R as 
operator set. 


5. Lattices. Here Q consists of two binary product symbols. If L is the carrier, we denote 
the result of applying these to a,b € L in the usual way as a v b and aa b, which are called the 
join and meet of a and b respectively. The lattice axioms are 


avb=bva, anb=baa. 
(av b)vc=av (bve), (an b)ac=aan (bac). 
ava=a, ana=a. 
(av b)Aa=a, (an b)va=a. 


There is another way of defining a lattice—as a certain type of partially ordered set. One 
begins with a set L witha partial order >. If S is a subset, an upper bound u (lower bound I) of 
Sis an element of L such that u > s (l < s) for every seS. A least upper bound or sup (greatest 
lower bound or inf) of S is an upper bound B (lower bound b) such that B < u (b > D for every 
upper bound u (lower bound /) of S. If a sup or inf exists for a set then it is unique. This is 
clear from the definition. One can define a lattice as a partially ordered set in which every pair 
of elements has a sup a v b and an inf aa b. It is easy to see that this definition is equivalent to 
the one given above. The equivalence is established by showing that the sup and inf satisfy the 
relations listed in example 5 and showing that conversely if one has a lattice in the algebraic 
sense, then one obtains a partial order by defining a > b if aa b = b (or, equivalently, a v b = 
a). Then the given a v b and aa b in L are sup and inf in this partially ordered set. The details 
are easily carried out and are given in full in BAI, pp. 459-460. 


A lattice (viewed as a partially ordered set) is called complete if every non-vacuous subset 
of L has a sup and an inf. Then the element 1 = sup{a|aeL} satisfies 1 > a for every ae L and 0 
= inf{a|aeL} satisfies 0 < a for every a. These are called respectively the greatest and least 
elements of the lattice. A very useful result is the following theorem: 


THEOREM 2.1. A partially ordered set L is a complete lattice if and only if L contains a 
greatest element | (= a for every a) and every non-vacuous subset of L has an inf 


The proof is obtained by showing that if S is a non-vacuous subset, the set U of upper 
bounds of S is not vacuous and its inf is a sup for S (BAI, p. 458). 

The preceding examples show that the study of Q-algebras has relevance for the study of 
many important algebraic structures. It should be noted, however, that some algebraic 
structures, e.g., fields, are not Q-algebras, since one of the operations in a field, the inverse 
under multiplication, is not defined everywhere. Also the results on Q-algebras have only a 
limited application to module theory since the passage to the Q-algebra point of view totally 
ignores the ring structure of the operator set. 


EXERCISES 


— 


. Let ® and ®'’ be correspondences from A to B, ¥ and ’ correspondences from B to C. 
Show that if ® c ®’ and ¥ cP’, then VO c P'O’. 


2. Let E and F be equivalence relations ona set A. Show that EF is an equivalence relation if 
and only if EF = FE. 


3. Let H be a subgroup of a group G and define a=,,b if b | ae H. This is an equivalence 
relation. Show that if K is a second subgroup, then =y and =. commute if and only if HK 
= KH. Hence show that if H-6 (H is normal in G), then = y = x is an equivalence 
relation. 


4. List all of the partitions of {1, 2, ...,) for n = 1, 2, 3, and 4. 


5. Let E(n) be the set of equivalence relations on a set of n elements, |E(n)| its cardinality. 
Prove the following recursion formula for |E(1)|: 


|E(n)| = |E(n—1)| + "DIE -D+ "3DE -3+ + ("2 DIEMI 1. 
6. Let A be an Q-algebra. Let 0 be an i-ary operator symbol. Write 
w Pw ?)03 094.440.4566” d4AgAo 


with parentheses and commas. 


2.2 SUBALGEBRAS AND PRODUCTS 


Let A be an arbitrary Q-algebra. A non-vacuous subset B of A is called a subalgebra of A if for 
any we Q(n) and (b4, bo, ..., bn), be B, wb, bz ... b EB. In particular, if n = 0, then B contains 
the distinguished element w, associated with w. For example, if A is a group, a subset B is a 
subgroup if and only if B contains bc, 1, and b~! for every b,ceB. This coincides with the usual 
definition of a subgroup. Observe that this would not have been the case if we had omitted the 
unary product symbol 7 in the Q-algebra definition of a group. If A is a A-group, a subalgebra 
is a subgroup B such that 4b € B for b e B,Xe A. This is called a A-subgroup of A. 

If B is a subalgebra of A, B becomes an Q-algebra by defining the action of every we Q by 
the restriction to B of its action in A. It is clear that if B is a subalgebra of A and C is a 
subalgebra of B, then C is a subalgebra of A. It is clear also that if {B,|ae/} is a set of 
subalgebras, then NB, is a subalgebra or is vacuous. It is convenient to adjoin Ø to the set of 


subalgebras of A. If we partially order the resulting set by inclusion, then it is clear that 
Theorem 2.1 can be invoked to conclude that this set is a complete lattice. Moreover, the proof 
we sketched shows how to obtain the sup of a given set {B,} of subalgebras: take the 


intersection of the set of subalgebras C containing every B,. Generally this is not the union U 
B, of the sets B,, which is the sup of the B, in the partially ordered set P(A), since UB, need 


not be a subalgebra. However, there is an important case in which it is UB, namely, if {B,} is 
directed by inclusion in the sense that for any By, B, in {Ba} there is a B; in this set such that 
B5 > Bz and B; > B,. For example, this is the case if {B,} is a chain (or is totally ordered). If 
{Ba} is directed and {Bg,,...,B, } is any finite subset of {B,}, then there exists a Bse{B,} 
such that B; > Bp. 1<i<n. 


THEOREM 2.2 Jf {B,} is a directed set of subalgebras, then UB, is a subalgebra. 


Proof. Let we Q(n). If n = 0 it is clear that wjEB = UB.. Now letn > 1, (b,,...,0,)EB™. 
Then 5;€B,€{B,} and so every b;€B; for a suitable B; in {B,}. Then wb, ... b EBs C UB.. 
Hence UB, is a subalgebra. O 


It is clear that if UB, is a subalgebra, then this is the sup of the B,. 
Let X be a non-vacuous subset of the Q-algebra A. Let {B,} be the set of subalgebras of A 


containing X and put LX] = NB.. Then [X] is a subalgebra containing X and contained in every 


subalgebra of A containing X. Clearly LX] is uniquely determined by these properties. We call 
[X] the subalgebra of A generated by the set X. We can also define this subalgebra 
constructively as follows: Put X) = X U U where U is the set of distinguished elements w4, 


we Q(0). For k > 0 let X14, =X, O = {ly = @x] «2. Xp Xp Xi E Xy, OE O(N), n = 1}. Evidently 


Xo C X; Cc... and it is clear that Ux, is a subalgebra containing X. Moreover, if B is any 
subalgebra containing X, then induction on k shows that B contains every X,;. Hence B > Ux, 
and [X] = WY, 

Let {A,|ael} be a family of Q-algebras indexed by a set Z, that is, we have a map a ~ A, of 
I into {44}. Moreover, we allow A, = A, for a + p. In particular, we may have all of the 4, = 
A. It is convenient to assume that all of the A, are subsets of the same set A. We recall that the 
product set Į [;4, is the set of maps a:a ~ a, of I into A such that for every a, a, E A,. We now 
define an Q-algebra structure on [| JA, by defining the products component-wise: If we Q(0), 
we define the corresponding element of []4, to be the map a ~ @, where w, is the element 
of A, singled out by œ. If œ € Q(n) for n > 1 and a", ..., a” € JJA, then we define wa) 
...a™ to be the map 


a wa, Pa, Pa,” 


which is evidently an element of | ]4,. If we do this for all œ, we obtain an Q-algebra structure 
on [|4,. IMM, endowed with this structure is called the product of the indexed family of Q- 
algebras {A,\(ael}. 

If = {1, 2,..., r}, we write 4; x A, x ... x A, for [];A,. The elements of this algebra are the 
r-tuples (a1, a,...,a,), a; E A; If œ is nullary, the associated element in A, x ... x A, is (4; 
W4,9-.04), and if n>] and a? = (a, a, ..., a,%), 1 < i < n, then wa) ... a is the 


element of A, x ... x A, whose ith component is wa{a ... a. 


EXERCISES 


1. Let G be a group and adjoin to the set of product symbols defining G the set G whose 


action is g(x) = g ‘xg. Verify that G is a group with operator set G. What are the G- 
subgroups? 


2. Let X be a set of generators for an Q-algebra A (that is, A = [X]). Show that 4 is the union 
of its subalgebras [F], F a finite subset of X. 


3. Show that an Q-algebra A is finitely generated if and only if it has the following property: 
The union of any directed set of proper (4A) subalgebras is proper. 


4. Give examples of the following: (i) an Q-algebra containing two subalgebras having 
vacuous intersection, and (ii) an Q-algebra in which the intersection of any two 
subalgebras is a subalgebra but there exist infinite sets of subalgebras with vacuous 
intersection. 


2.3 HOMOMORPHISMS AND CONGRUENCES 


DEFINITION 2.2. IfA and B are Q-algebras, a homomorphism from A to B is a map f of A 
into B such that for any we Q(n), n= 0, 1, 2, ..., and every (a, ..., d,) € A” we have 


(6) floa a) = of (a,)f (az) f(a,). 


In the case n = 0 it is understood that if w, is the element of A corresponding to œ then 


f(w 4) = Op. 


It is clear that the composite gf of the homomorphisms f: 4 — B and g: B > Cis a 
homomorphism from A to C and that 1, is a homomorphism from A to A. It follows that we 


obtain a category, Q-alg, whose objects are the Q-algebras and morphisms are the 
homomorphisms. It is interesting to see how the important category ideas apply to Q-alg and 
its important subcategories. We observe now that the product construction of Q-algebras 
provides a product in the sense of categories for Q-alg. Let {A,|aeJ/} be an indexed family of 


algebras and [[/A, the product of these algebras as defined in section 2.2. For each a we have 
the projection map p,:a a, of P=|JA, into A,. It is clear from the definition of the action of 
the œ’s in A that p, is a homomorphism of P into 4,. We claim that {P, pa} is a product in Q- 
alg of the set {4,|aeJ}. To verify this, let B be an Q-algebra, f,:B — A, a homomorphism of B 
into 4, for every a e€ J. It is immediate that if we map any b e B into the element of P whose 
“a-component” is f (b), we obtain a homomorphism f of B into P. Moreover, paf = fẹ and f is 
the only homomorphism of B into P satisfying this condition for all a e J. Hence {P p,} is 
indeed a product of the 4, in Q-alg. 


DEFINITION 2.3. A congruence on an Q-algebra A is an equivalence relation on A, which 
is a subalgebra of A x A. 


If © is an equivalence relation on the Q-algebra A and we Q(0), then (@4, @,) € ® and this 
is the element œw; x 4 corresponding to the nullary symbol œ in the algebra A x A. Hence an 
equivalence relation ® is a congruence if and only if for every we Q(n), n = 1, and (a; a;')E®, 
i=1,2,...,n, we have w(ay, a')(ao, a>’) ... (ap ay’) E D. Since œ(a;, a1) (az, ay’) ... (dps Ap’) 
= (@a)a>...a,, @ay'ay' ... a,'), the requirement is that a,Oa;' 1 < i < n, implies wa, ... a,®wa,' 

.. a,'. Thus the condition that an equivalence relation be a congruence is that this last property 


holds for every n-ary operator symbol with n > 1. Let A/® be the quotient set of A determined 
by the congruence relation ® and let a be the element of A/® corresponding to the element a of 
A, Since a,®a';,, 1 <i <n, implies wa, ... a Powa’ ... a,', we see that we have a well-defined 


map of (4/®)) into A/® such that 


(;,...,4,) > Ma; °*"d,. 


n 


We can use this to define 


(7) wa," a, = Oa, a, 


n 


and we do this for every we Q(n), n > 1. Also for we Q(0) we define 
(3) Oje =O, 


In this way we endow 4/® with an Q-algebra structure. We call A/® with this structure the 
quotient algebra of A relative to the congruence ©. In terms of the natural map v-a~a of A 
into A/®, equations (7) and (8) read wva, ... va, = V@a)...d,, O4/p = Voy. Evidently this 
means that the natural map v is a homomorphism of A into 4/®. 

Let f be a map of a set A into a set B. Then ® = ff is a relation on A. Since f is the subset 
of A x B consisting of the pairs (a,f(a)), a€A, and f! is the subset of B x A of pairs (f(a), a), f 
fis the set of pairs (a, a’) in A x A such that f(a) = f(a’). Thus a@a’ if and only if f(a) = f(a’) 
and it is clear from this that ® is an equivalence relation on the set 4 (BAI, p. 17). We shall 
now call this equivalence relation the kernel of the map f. Since a®a' implies f(a) = f(a’), it is 
clear that f (a) = f(a) defines a map f of A/® into B such that we have commutativity of 


A f B 


Al® 


Here v = vq is the map a ~ @ = @g. Moreover, J is uniquely determined by this property and 
this map is injective. Clearly v is surjective. We now extend this to Q-algebras in the so-called 


FUNDAMENTAL THEOREM OF HOMOMORPHISMS OF Q- ALGEBRAS. Let A and 


B be Q-algebras, f a homomorphism of A into B. Then ® =f 'f is a congruence on A and the 
image f(A) is a subalgebra of B. Moreover, we have a unique homomorphism f of A/® into 
B such that f= Í v where v is the homomorphism a ~ ā of A into A/®. The homomorphism f 
is injective and v is surjective. 


Proof. Let we Q(n), n = 1, (a; a;)E® for 1 <i <n. Then fa) = fa’) and Koa ... a,) = 
ofa) ... Kap = ofa’) ... fla’) = fl@ay' ... a',). Hence (wa, ... ap, way’... a,')E®, So 
(a), a'i) ... (dap a,')e®. Since ® is an equivalence relation, this shows that ® is a 
congruence on the algebra A. It is immediate that f(A) is a subalgebra of B. Now consider the 


map / . If we Q(0), then f (wy) = T © 4) = Koa) = wp and if we Q(n), n= 1, and a; € A, 1 < 
i < n, then 


f(oā, ...4,) = f (Oa, ..-4,) 
= f (wa; .. . ap) 
= wf (a,)...f (ap) 
= wf (ä,) .. f (&,). 


Hence / is a homomorphism. The remaining assertions are clear from the results on maps that 
we noted before. O 

We shall consider next the extension to Q-algebras of the circle of ideas centering around 
the correspondence between the subgroups of a group and those of a homomorphic image 
(BAI, pp. 64-67). Let f be a surjective homomorphism of an Q-algebra A onto an Q-algebra B. 
If A, is a subalgebra of A, the restriction flA; is a homomorphism of 4; hence f4) is a 


subalgebra of B. Next let B, be a subalgebra of B and let A, = f'(B,), the inverse image of B}. 
Then w,€A, for every nullary œ, and if ajeéA, and we Q(n), n = 1, then f(wa, ... a,) = wf(a;) 
f(A) €B s0 way...a,€A,. Hence A, =f |(B)) is a subalgebra of A. Moreover, f(4,) = B; and 
A, is a saturated subalgebra of A in the sense that if a,¢ 4, then every a,’ such that f(a,') = 
f(a) is contained in A). It is clear also that if A, is any saturated subalgebra of A, then A, = f 
~I(f(A,)). It now follows that the map 


A, >f (Aj) 
of the set of saturated subalgebras of A into the set of subalgebras of B is a bijection with 
inverse B; ~f /(B,). 


This applies in particular if we have a congruence ® on A and we take B = A/O, f = v, the 
natural homomorphism a ~ @ = @q of A onto A/®. In this case, if A, is any subalgebra of A, v 


~1(,(A,)) is the union of the congruence classes of A (determined by ®) that meet A,, that is, 
have an element in common with A,. The map v; = v|A, is a homomorphism of A, into A/®). 


Now one sees that the image is A’,/®,' where A,’ = v |(v(4,)) and ®,' is the congruence ® 
M(A,' x 4") on 4'. On the other hand, the kernel of v; is ®© N (A, x A,). We may therefore 
apply the fundamental theorem of homomorphisms to conclude that 


(10) Gio, > A, 
for a, € A, is an isomorphism of A,/®, onto A’;/@,'. We state this result as the 


FIRST ISOMORPHISM THEOREM. Let ® be a congruence on an Q-algebra A, A, a 


subalgebra of A. Let A,' be the union of the ® equivalence classes that meet A,. Then A,' is a 
subalgebra of A containing A,, B,'= D N (A,' x A,') and ®, = ® N (A, x 4) are congruences 
on A,' and A, respectively, and (10) is an isomorphism of A,/®, onto A,'/®,'. 


We shall consider next the congruences on the quotient algebra 4/®, ® a given congruence 
on A. We note first that if O is a second congruence, then we have the correspondence 


(11) Vajo = { (ão, ão)laE A} 


from A/® to A/@. This is a map of 4/® to A/O if and only if ab implies a®b, which means if 
and only if ® c ©. In this case ve/@): 4p ~ 4@ is the unique map of A/® into A/O such that 


A Vo AlQ 
Vo 
(12) Vajo 
Al® 


is commutative. Direct verification, which we leave to the reader, shows that ve is an 
algebra homomorphism. It is clear that vg is surjective and it is injective if and only if © = 
@. 

We now denote the kernel of the homomorphism veg of A/® into 4/O by ©/®. This is a 
congruence on 4/® consisting of the pairs (@g, bg) such that ão = bọ. It follows from this that 
if ©, and O, are two congruences on A such that ©; > ®, then ©, > O,, if and only if O,/® > 
Q,/®. In particular, ©,/® = ©,/@ implies ©, = ©). 

We shall show next that any congruence © on 4/® has the form ©/® where © is a 


congruence on A containing ®. Let ¥ be the canonical homomorphism of A/® onto (4/®0)/O. 
Then ¥vg is a surjective homomorphism of A onto (4/@)/O. If © is the kernel, by the 


fundamental theorem of homomorphisms, we have a unique isomorphism v’ : 4/O — (4/®)/@ 
such that 


A Wg (AlO)/0 


(13) Va ' 


AlO 


is commutative. Then vg = (v’) !¥vg. This implies that © > ® and that we have the 
homomorphism ve@/g as in the commutative diagram (12). Since v@ is surjective on the algebra 
A/® and ve = (v) !¥ vq = VejpV@, We have the equality of the two homomorphisms (v) tř? and 
Vem Of the algebra A/®. Since v' is an isomorphism, the kernel of (v’) tř is the same as that of 
¥. Hence the homomorphisms ¥ and vo; of 4/® have the same kernel. For ¥ this is © and for 
vo it is O/®. Hence 9 = O/O as we claimed. 

The results we derived can be stated in the following way: 


THEOREM 2.3. Let A be an Q-algebra, ® a congruence on A, A/® the corresponding 
quotient algebra. Let © be a congruence on A such that © > ®. Then there exists a unique 
homomorphism Veg: A/® — A/O such that (12) is commutative, and if ©/® denotes the 


kernel of vg; then @/® is a congruence on A/®. The map © ~ O/® is a bijective map of the 
set of congruences on A containing ® onto the set of congruences on A/®. Moreover, ©, > 
©, for two congruences on A containing ® if and only if ©,/® > O,/®. 


We also have the 


SECOND ISOMORPHISM THEOREM. Let © and ® be congruences on the Q-algebra 
A such that © > ® and let ©/® be the corresponding congruence of A/® given in Theorem 
2.3. Then 


(14) (doop ~ ão, aéA, 
is an isomorphism of (A/®)/(©/®) onto A/O. 


Proof. The homomorphism vo; of A/P onto A/O maps 4g ~ āe and has kernel ©/®. 
Accordingly, by the fundamental theorem, (14) is an isomorphism of (4/®)/(©/®) onto 4/0. O 


The results on homomorphisms of Q-algebras specialize to familiar ones in the case of the 
category Grp (see BAI, pp. 61—66). If fis a homomorphism of a group G into a group H, then 
K =f! (1) is a normal subgroup of G. The kernel ® = f!f is the set of pairs (a, b), a, b € G, 


such that a /beK. It follows that äp = aK = Ka and G/® is the usual factor group G/K. The 


fundamental theorem on homomorphisms, as we stated it, is equivalent in the case of groups to 
the standard theorem having this name in group theory. The other results we obtained also have 
familiar specializations (see exercise 1, below). Similarly, if R and S are rings and f is a 
homomorphism of R into S, then K = f !(0) is an ideal in R and R/® = R/K. Our results 
specialize to the classical ones for rings (see exercise 2, below). 


EXERCISES 


1. Let G and H be groups, f a homomorphism of G into H, and K = f }(1). Then in the 
standard terminology, K is called the kernel of f. The standard formulation of the 
fundamental theorem of homomorphism for groups states that f: aK ~ f(a) is a 
monomorphism of G/K into H and we have the factorization f = fv where v is the 
epimorphism a ~ aK. The first isomorphism theorem for groups states that if H is a 
subgroup of a group G and K is a normal subgroup of G, then the subgroup generated by 
H and K is HK = {hk|he H, ke K}, K is normal in HK, K N H is normal in H, and we 
have the isomorphism 


(15) hK ~h(H a K) 


of HK/K onto HH N K). The second isomorphism theorem states that if G>H , K and H 
> K, then H/K is normal in G/K and 


(16) gH > (gK) (H/K) 


is an isomorphism of G/H onto (G/K)/(H/K). Derive the fundamental theorem of 
homomorphisms and the isomorphisms (15) and (16) from the results on congruences on 
Q-algebras. 


State the corresponding results for A-groups (groups with operators). Do the same for 
modules for a ring R. 


2. Derive the results on ring homomorphisms analogous to the preceding ones on groups 
from the theorems on congruences on Q-algebras. 


3. Let A be an Q-algebra, X a set of generators for A (that is, the subalgebra LX] = A). Show 
that if f and g are homomorphisms of A into a second Q-algebra B such that fX = glx, 
then f= g. 


2.4 THE LATTICE OF CONGRUENCES. SUBDIRECT PRODUCTS 


In this section we shall consider first some additional properties of the set of congruences on 


an Q-algebra A. Then we shall apply these to study the important notion of subdirect products 
of algebras. 

We investigate first the set T (4) of congruences on A as a partially ordered set in which the 
ordering > is the usual inclusion ® > ©. This, of course, means that a@@b = a@®b or, 
equivalently, ão = bo > dg =b o. 


THEOREM 2.4 If {®,} is a set of congruences on the Q-algebra A, then No, is a 


congruence. Moreover, if {®,} is directed, then Uo, is a congruence. 


Proof. We note first that if {®,} is a set of equivalence relations on A then No, is an 
equivalence relation, and if {®,} is directed then Uo, is an equivalence relation: Since every 
Pa > lj, No, > 14. If (x, y)e P, then (y, x)e Pı; hence, if (x, y) € N P, then (y, xjeND,. 
Now let (x, y) and (y, z)e ®,; then (x, z)e Pa. Thus if (x, y) and (y, ze No, so does (x, z). 
Hence Ne, is an equivalence relation. Next assume {®,} is directed. It is clear that U, D lj 
and (Uo! = Uo.. Now let (x, y) and (y, z)€ U. Since {@,} is directed, there exists a Ds € 
{Ð} such that (x, y) and (y, z)e ®;. Then (x, z)e@s so (x, z)e Uo,. Hence Uo, is an 
equivalence relation. 


Now let the ©, be congruences so these are subalgebras of A x A as well as equivalence 
relations. Then Ne, is an equivalence relation and so this is not vacuous. Hence No, is alsoa 


subalgebra and thus a congruence. Similarly if {®,} is directed then Uo, is a subalgebra. O 


Clearly No, is an inf of the set of congruences {@,}. It is evident also that the subset A x A 


of A x A is a congruence and this is the greatest element in the partially ordered set of 
congruences. The corresponding quotient algebra AA x A) is the trivial algebra, that is, a 
one-element algebra (in which the action of every wE Q is uniquely determined). It is clear 
also that the diagonal 1, is a congruence, and 4/1, can be identified with A since the 


homomorphism v; is an isomorphism. The fact that the set (A) of congruences on A has a 


greatest element and has the property that infs exist for arbitrary non-vacuous subsets of I(A) 
implies, via Theorem 2.1 (p. 57), that (A) is a complete lattice. In particular, any two 
congruences ® and © have a sup ® v © in IA). We shall indicate an alternative, more 
constructive proof of this fact at the end of this section, in exercise 3. 


Given a binary relation R on the algebra A we can define the congruence [R] generated by 
R to be the inf (or intersection) of all the congruences on A containing R. This is characterized 
by the usual properties: (1) [R] is a congruence containing R and (2) [R] is contained in every 
congruence containing R. 

We recall that a maximal element of a subset S of a partially ordered set A is an element 
meS such that there exists no s in S such that s 4 m and s > m. A basic existence theorem for 


maximal congruences is 


THEOREM 2.5. Let a and b be distinct elements of an Q-algebra A and let D(a, b) be the 
set of congruences ® on A such that ® #(a, b). Then D(a, b) is not vacuous and contains a 
maximal element. 


Proof. Evidently 1,€D¢a, b) so D(a, b) + Ø. Now let {®,} be a totally ordered subset of 
D(a, b), that is, for any Pg, P E {®,} either Pp > P, or P, > Pp. Then {®,} is directed. 
Hence Uo, is a congruence. Evidently Uo, D(a, b) and Uo, > P, Thus every totally 
ordered subset of D(a, b) has an upper bound in D(a, b). We can therefore apply Zorn’s lemma 
to conclude that D(a, b) contains maximal elements. OD 


The following is a useful extension of Theorem 2.5. 


COROLLARY 1. Let © be a congruence on A and a and b elements such that āo # bg. 
Then the set of congruences ® such that ® > © and 4g £ b ọ contains a maximal element. 


Proof. We apply Theorem 2.5 to the algebra A/O and the pair of elements (āo, bọ) of this 


algebra. We have a congruence in this algebra that is maximal among the congruences not 
containing (ão, bg). By Theorem 2.3, such a congruence has the form ®/@ where ® is a 


congruence in A containing ©. Since ®/O¥(ag, bo), vooo # Vom" o, Which means that ag # 
b p. Moreover, ® is maximal among the congruences containing © having this property. D 


There are many important special cases of this result. One of these, which we now 
consider, concerns the existence of maximal ideals of a ring. An ideal B (left, right ideal) of a 
ring R is called maximal if B # R and there exists no ideal (left, right ideal) B’ such that R # B' 
? B. Observe that an ideal (left, right ideal) B is proper (B # R) if and only if 1 ¢ B. We now 
have the following result. 


COROLLARY 2. Any proper ideal (one-sided ideal) in a ring R + 0 is contained in a 
maximal ideal (one-sided ideal). 


Proof. To obtain the result on ideals we use the fact that any congruence in a ring R is 
determined by an ideal B as the set of pairs (a, a’), a, a’ E€ R, such that a — a'e B. Conversely, 
any ideal is determined by a congruence in this way, and if ®, is the congruence associated 
with the ideal B, then Pg > Dp, for the ideal B’ if and only if B > B’. Now if B is proper, then 
Pp does not contain (1, 0). Then Corollary 1 gives a congruence ®j,, M an ideal, such that ®j, 
> Op, Dy € (1, 0), and Ọ y is maximal among such congruences. Then M > B, M # 1, so M is 
proper, and M is maximal in the set of ideals having this property. It follows that M is a 
maximal ideal in the sense defined above. The proof for one-sided ideals is obtained in the 
same way by regarding R as an Q-algebra that is a group (the additive group) with operators, 


the set R of operators acting by left multiplications for the case of left ideals and by right 
multiplications for right ideals. O 


Of course, these results can also be obtained by applying Zorn’s lemma directly to the sets 
of ideals (see exercise 1 below). However, we thought it would be instructive to deduce the 
results on ideals from the general theorem on congruences. 


We consider again an arbitrary Q-algebra A and let {O,|ae/} be an indexed set of 
congruences on A such that Ne, = 1,. For each ael we have the quotient 4, = A/®, and we 
can form the product algebra P = J [ae 74a: We have the homomorphism v,:a ~4@@ of A onto A, 
and the homomorphism v: a ~a where a, =4@ of A into P. The kernel of v is the set of pairs 
(a, b) such that ão = bọ for all a a e J. This is the intersection of the kernels ©, of the v. 


Since Ne, = 1), we see that the kernel of v is 1, and so v is a monomorphism. It is clear also 
from the definition that for any a the image p,,v(A) of v(A) under the projection p, of P onto A, 
is A). 

If {A,|ae/} is an arbitrary indexed set of algebras and P = J JA, then we call an algebra A 
a subdirect product of the A, if there exists a monomorphism i of A into P such that for every 
a, i, = Pai is surjective on A,. Then we have shown that if an algebra A has a set of 


congruences {@,|ae/} such that Ne, = 1,, then A is a subdirect product of the algebras A, = 
A/O, Conversely, let A be a subdirect product of the A, via the monomorphism i. Then if ©, is 


the kernel of i, = p,i, it is immediate that Ne, = 1, and the image i,(A) = 4/O,. 


DEFINITION 2.4. An Q-algebra A is subdirectly irreducible if the intersection of all of the 
congruences of A# 1, is #14. 


In view of the results we have noted we see that this means that if A is subdirectly 
irreducible and A is a subdirect product of algebras A,, then one of the homomorphisms i, = 


Pai is an isomorphism. If a # b in an algebra A, then we have proved in Theorem 2.5 the 
existence of a congruence ®, p that is maximal in the set of congruences not containing (a, b). 
We claim that A/®, p is subdirectly irreducible. By Theorem 2.3, any congruence in A/®, p 
has the form ©,/®, p where ©, is a congruence in A containing ®, p. If this is not the diagonal 
then ©, # &), p and so by the maximality of ®, p, (a, b) € ©,. Then (do, » bo, JEO/Pa p 
for every congruence 0,/®, p on 4/®, p different from the diagonal. Evidently this means that 
A/®, p is subdirectly irreducible. We use this to prove 


THEOREM 2.6 (Birkhoff). Every Q-algebra A is a subdirect product of subdirectly 
irreducible algebras. 


Proof. We may assume that A contains more than one element. For every (a, b), a + b, let ®, p 


be a maximal congruence such that (a, b) ¢ ®, p. Then A/®, p is subdirectly irreducible. Also 


No, p = 14. Otherwise, let (c, d)e Ne, p». € 4 d. Then (c, d)e ®, q, a contradiction. Hence () 
®, ¿= 14 and consequently A is a subdirect product of the subdirectly irreducible algebras A/ 
P, b: a 


EXERCISES 


— 


. Give a direct proof of Corollary 2 to Theorem 2.5. 
2. Show that Z is a subdirect product of the fields Z2/(p), p prime. 


3. Let © and ® be congruences on A. Show that @® c (O@)? c (OP)? c ... and U,(@w)y 
is a congruence that is a sup of © and ®. It follows from this that any finite set of 
congruences has a sup. Use this and the second statement of Theorem 2.4 to show that 
any set of congruences has a sup. 


4. Let {©,} be an arbitrary set of congruences in an algebra A and let © = No.. Show that 
A/® is a subdirect product of the algebras 4/O,. 


2.5 DIRECT AND INVERSE LIMITS 


There are two constructions of Q-algebras that we shall now consider: direct limits and 
inverse limits. The definitions can be given for arbitrary categories, and it is of interest to 
place them in this setting, since there are other important instances of these concepts besides 
the one of Q-algebras that is our present concern. We begin with the definition of direct limit in 
a category C. 

Let J be a pre-ordered set (the index set) and let I be the category defined by Z as in 
example 12, p. 13: ob I=J/ and for a, p € J, hom (a, p) is vacuous unless a < p, in which case 
hom (a, p) has a single element. If C is a category, a functor F from I to C consists of a map a 

» A, of J into ob C, and a map (a, P) ~ Pap E hom,(A,, Ag) defined for all pairs (a, £) witha < 
p such that 


1. Pay = Pp Pop ifasBSy. 


2: Oaa — lax 


Given these data we define a direct (or inductive) limit lim (A,, Pag) as a set (A, {Na} ) where 
Ae obC, Na:4a 4, a € I, satisfies Na = NgPqpg and (4, {Na} ) is universal for this property in the 
sense that if (B, {¢,}) is another such set, then there exists a unique morphism 6: A — B such 


that (Ca = Na, a € I. (This can be interpreted as a universal from an object to a functor for 


suitably defined categories. See exercise | at the end of this section.) It is clear that if a direct 
limit exists, then it is unique up to isomorphism in the sense that if (4’, {7',}) is a second one, 


then there exists a unique isomorphism 8: A — A’ such that 7’, = On, ae. 


A simple example of direct limits can be obtained by considering the finitely generated 
subalgebras of an Q-algebra. Let A be any Q-algebra and let J be the set of finite subsets of A 
ordered by inclusion. If F € J, let Ap be the subalgebra of A generated by F and if F c G, let 


Ørg denote the injection homomorphism of A; into Ag D Ap. Evidently ppr = 1p and pGHPFG = 
Ory lf F c G c H. Moreover, if yp denotes the injection homomorphism of A; into A, then 
ngora = Np. Now let B be an Q-algebra and {¢,|Fe/} a set of homomorphisms such that ¢p: 
A; — B, satisfies CcQrco = G for F CG. Since Ogg is the injection of Ap into Ag, this means 
that the homomorphism €, on Ag is an extension of ¢ on Ap. Since UA, = A, there is a unique 
homomorphism 0 of A into B such that ĉr = 6, p el. It follows that A and {yp} constitute lim 
(Ak org). The result we have proved can be stated in a slightly imprecise form in the following 
way: 


THEOREM 2.7. Any Q-algebra is isomorphic to a direct limit of finitely generated 
algebras. 


We shall now show that if the index set / is directed in the sense that for any a, p e J there 
exists a ye/ such that y > a and y > p, then the direct limit exists in the category C = Q-alg for 
any functor F from I to C. 


Suppose the notations are as above where the 4, are Q-algebras and p,p for a < f is an 
algebra homomorphism of 4, into A,. To construct a direct limit we consider the disjoint union 
U4, of the sets A,. We introduce a relation ~ in U4, by defining a ~ b for ac A,, be Ag if there 
exists a p > a, P such that ga, (a) = 9,(b). (We allow a = £). This relation is evidently 
reflexive and symmetric. Also if 9,,,(a) = 9g,(5) and Øg (b) = 9,,(c) for p = a, p and o = P, y, 
then there exists a t > p, o. Then 


ula) = Pa Papl) a PpPBolb) = Pp(b) 
oP Pa: Ppa(b) = PorPyq(C) = (,,(C), 


which shows that a ~ b and b ~ c imply a ~ c. Hence our relation is an equivalence. 

Let A be the corresponding quotient set of equivalence classes a, aeU4,. If @ e Q(0), then 
O4, Z O4, since there exists a p > a, J and ,,(w,4 ) = O4, = pp (@4,)- We define œ; = wy, . 
Now letn > 1 and let 1 < i < n. Let a; € A,. Choose p > a; 1 <i < n, which can be done since 


Tis directed. Now define 


(17) (a, En ln = Px p41) Parp(42) iii Panp (An): 


This is clearly independent of the choice of the a; n 4@;. We also have to show that this is 
independent of the choice of p. Hence let o> a;, 1 <i <n, and choose t = p, o. Then 


Por, p (41) oink Panp (a,) = OP ye Pa, (41) vie O pP anp (an) 
m2 OPa, (41). «Pant (Gy) 


and similarly, 9,,0Pq,(41)-+-Po,o(4n) = OPa (41). - -Pa zn) and so (17) is independent of the 
choice of p > a; 1 < i < n. This definition of the action of every we Q(n), n = 1, together with 


the previous one of the nullary operations in A gives an Q-algebra structure on A. 
Now fix a and consider the map ,/a, ~ 4, of A, into A. If we Q(0), then 4,04, = @4 = 


w,andifn>1anda,"”,..., a, € A, then taking p = a in (17) gives 


(1) 


wa,")...d, (1) 


(n) (n) 


= 04.a 


a 


Hence 7, is a homomorphism. We have for a < p, gag) = aa SINCE Pag) = Pp—pPaplAa)> 
which shows that 7404(4g) = Nalda) Thus 179g = Na ifa 2 B. 

Next suppose we are given an Q-algebra B and homomorphisms ¢,:4, — B satisfying C,P a, 
= 9, for a <f in I. Suppose 4, = 4, so we have a p > a, p such that 9,,(a,) = pg,(ag). Then 


Cala) = CoP apla) = CoO plap) = Cp(ap). 


Hence we have the map 0 : 4, ~C,(a,) with domain A = U alda) and codomain B. This 
satisfies C, = 97, since 67,(a,) = 9(4,) = Calada) for all aE 4y a € T. Since A is the union of 
the 74(44)» it is clear that 8 is determined by this condition. Also since g,ga, ~ a, if P = a, any 
finite set of equivalence classes relative to ~ has representatives in the same 4,. It follows 
from this that 0 is an Q-algebra homomorphism. Thus we have the universality of (4, {7,}), 
which we require in the definition of direct limit. We can therefore state the following 


THEOREM 2.8. Direct limits exist in the category of Q-algebras for every directed set of 
indices I. 


The notion of inverse limit is dual to that of direct limit. Again we begin with a pre-ordered 
set /= {a} and a category C, but now we let F be a contravariant functor from the category I 
defined by J to C. Thus we have a map a ~ A,€0b C, a € J, and for every a, 6 witha < p we 


have a morphism Øg, : Ag —> A, such that 
T. Pya = PBaP yp ifa <p < Y. 


Z. Daa = 14 


Then an inverse (or projective) limit is a set (A, {7,,}), where A € obC, 7, : A — A, such that 
Phap = Na ifa < P, and if (B, {¢,}) is a set such that B € ob C, ¢ , : BA, and Ppap = Ca for a 
< p, then thhere exists a unique morphism 0 : B — A such that 7,0 = ¢,, a € I. We denote (A, 
{Nq}) by lim (4,, Pag). We have the usual uniqueness up to isomorphism: If (44 {7',}) is a 
second inverse limit, then there exists a unique isomorphism 8 : A' — A such that 7,0 = N'y 

Now suppose C = Q2-alg so the 4, are Q-algebras and for a < P, gp, is a homomorphism of 
Ag into A,. To attempt to construct an inverse limit algebra, we begin with the product algebra 
[]A, and the projections pọ: | 4, — A, defined by p,(a) = a, where aisa a,E4 We now 
define 


(18) A = {a€[]A,|p,(4) = PpaPgl(a),% < P}, 


a, P € I. It may happen, as will be seen in exercise 2 below, that A is vacuous. In this case, no 
solution of our problem exists. For, if B is any algebra and ¢,:B—A, is a homomorphism for a 


€ I such that €, = Øga%p» then for any be B, any element ae ] JA, such that a, = ¢,(b) satisfies the 
condition in (18). 

Now suppose A defined by (18) is not vacuous. Then we claim that A is a subalgebra of 
JM, If oe Q(0), then Pay, ) = © 4, SO if a < p, then 9p(@A,) = œ A, gives Paloa) = 
P poP p@TTA,,)- Hence @yy € A. (Observe that this shows that A + o if Q contains nullary 


operator symbols.) Next let we Q(n), n > 1, and let a,..., a™ € A Then 


ploa- a™) = wa, ...a, = œp (a)... p (a™) 
! 
= OP paP gl j oie P paD g(a”) 


= Ppap(wa”...a). 


Hence wa)... a” e A and A is a subalgebra. If A is not vacuous, we claim that this 
subalgebra of Į JA, together with the homomorphisms 7, = p,|4 is an inverse limit lim (4, Pap) 


in Q-alg. By definition of A we have 71, = Øgang if a < p. Now suppose we have an Q-algebra 
B and homomorphisms ĉa: B — Aa a € J, satisfying C, = Ppop if a < p. We have a unique 
homomorphism unique homomorphisms 0’: B — ] JA, such that b ~ a where a, = ¢,(b). Then, 
as we saw before, acA, so 6’ defines a homomorphism 0 of B into A such that 0b) = 0'@®)). 
Now 7,0(b) = ¢,(6) and @ is uniquely determined by this property. This completes the 
verification that (4, {7,}) = lim (A, Pap). 

An important special case of inverse limits is obtained when we are given a set of 
congruences {®,} on an algebra B. We pre-order the set / = {a} by agreeing that a < p if @, 


> ®;. Put A, = B/P, and for a < p let Øg, be the homomorphism bo, + bba, of Ag into Ag 
Then Paa = 145 PpaPyp = Pya ifas P <y. Let va denote the homomorphism b b p, of B onto 
Aa Then ØpaVg = Va if a < P. This implies that lim (44, Pag) = (4, {Ma3 ) exists. Moreover, we 
have the homomorphism 6:B — A such that 7,0 = v,. Now suppose No, =1,.Lethce B 
satisfy 0b = Gc. Then v,b = v,c or bg = ĉo, for all a. Since Mo, = 1, we obtain b = c. Thus 
in this case the homomorphism 6 of B into A is a monomorphism. 


EXAMPLE 


Ring of p-adic integers. As an example of the last construction we shall define the important 
ring of p-adic integers for any prime p as an inverse limit. It should be remarked that this 
construction is essentially the original one due to Hensel, which preceded the valuation 
theoretic approach that is now the standard one (see Chapter 9, p. 548). 


Let p be a prime in Z and let (p*) be the principal ideal of multiples of p% for k = 1, 2,.... 
Let ®, be the congruence determined by p*: a®,b means a = b (mod p*). Then z/®, = 2/(p*), 


the ring of residues modulo p*. We have ©, c ®, if / > k and ND, = ly. We can form the 
inverse limit of the finite rings 2/(p"), which we call the ring Z, of p-adic integers. An 
element of Z, is a sequence of residue classes (or cosets) (a, + (p), an + (p*), a3 + (p>),...) 
where the a; are integers and for / > k, a, = a, (mod p*). We can represent this element by the 
sequence of integers (a), d>,...) where a, = a; (mod p*) for k < l. Then two such sequences (ai, 
d>,...) and (bj, b5,...) represent the same element if and only if a, = b, (mod p*), k = 1, 2,.... 
Addition and multiplication of such sequences is component-wise. If a € Z, we can write a = 
ro trip +... +r p” where 0 <r; < p. Then we can replace the representative (a), az, ...) in 


which a, = a, (mod p*) if k < l by a representative of the form (ro, Fo + rP, Fo + rip + ryp?,---) 
where 0 < r; < p. In this way we can associate with any element of Z, a uniquely determined p- 
adic number 


(19) ro +r p+r t+" 


where 0 < r; < p. Addition and multiplication of these series corresponding to these 
compositions in Z, are obtained by applying these compositions on the r; and “carrying.” For 
example, if p =5, we have 


(142.54+3.57 + ++:)+(34+3.54+2574 ++) = 440.54+1.54 
(142,543.57 +) (343.542.5274 ++) = 344.54+3.54 


EXERCISES 


— 


. Let / be a pre-ordered set, I the associated category defined as usual. Let a ~ Ay, Pag: Ay 
— Ap fora < p anda ~ Ba, Wap: Ba — Bz be functors from I to a category C (as at the 
beginning of the section). Call these F and G respectively. A natural transformation À of 
F into G is a map a ~ 4, where 2, : A, > Ba such that for a < P, Wapha = ApPap. Verity 


ap a 

that one obtains a category C! by specifying that the objects are the functors from I to C, 
and if F and G are two such functors, define hom (F, G) to be the class of natural 
transformations of F to G (which is a set). Moreover, define lp as on p. 25 and the 
product of natural transformations as on p. 25. (This completes the definition of C!.) If A 
€ ob C, let F, e C! be defined by the following: a ~ A, Pap = 14 for a < p. Define a 
functor from C to C! by A ~ F, and if f: A — B, then f is mapped onto the natural 
transformation of the function F4 into the functor Fg, which is the morphism f from A, = 
A to B, = B for every a € J. Show that a direct limit "™ (4,, Pap) can be defined as a 
universal from F (as object of C) to the functor from C to C! that we have just defined. 


2. Let J be the ordered set of positive integers (with the natural order) and for k € J, let 4; 
=I be regarded as a semigroup under addition. For k < l let g, be the map x ~ 2'*x of 
A; to A,. Note that this is a homomorphism of semigroups and mk = PKPmb Pu = 1 Ap 
Show that an inverse limit does not exist in the category of semigroups for this functor 
from I to the category. 

3. Let Z, be the ring of p-adic integers and represent the elements of Z, as sequences a = 
(aj, A, A3,...), A, E Z, a, = ay (modp*) if k < I (as above). Show that a (a, a, a,...) is a 
monomorphism of Z in Z,. In this way Z is imbedded in Z, and we may identify a with 
(a, a, a,...). Show that Z, has no zero divisors. Hence this has a field of fractions Q,. 
This is called the field of p-adic numbers. 

4. Assume p # 2 and let a be an integer (element of Z) such that a = 0 (mod p) and the 
congruence x° = a (mod p) is solvable. Show that x’ = a has a solution in Z,.. 


5. Show that the units of Z, are the elements that are represented by sequences (a), az,...) 
with a, = a; (mod p*), if k < l, such that a, #0 (mod p). 


2, X 
6. Leta € Z, a #0 (mod p), and put u = (a, a”, aP ,...). Show that this represents an element 
of Z, and uP! = 1 in Z„. Hence prove that Z, contains p — | distinct roots of 1. 


2.6 ULTRAPRODUCTS 


The concept of ultraproducts of algebras and of more general structures that involve relations 
as well as compositions plays an important role in mathematical logic, where it was first 


introduced. It has turned out to be a useful tool in algebra as well. For the sake of simplicity, 
we confine our attention to ultraproducts of Q-algebras. We need to recall some results on 
filters in Boolean algebras that were given in BAI. Although these will be required only for the 
Boolean algebra of subsets of a given set, it seems worthwhile to discuss the results on filters 
for arbitrary Boolean algebras, especially since it is conceptually simpler to treat the general 
case. 

We recall that a Boolean algebra is a lattice with least and greatest elements 0 and 1 that is 
distributive and complemented. The first of these conditions is that aa (bv c) =(aa b)v (aa 
c) and the second is that for any a there exists an element a’ in the lattice such that a v a’ = 1 
and a a a'= 0. We refer the reader to BAI, pp. 474480, for the results on Boolean algebras 
that we shall require. In the Boolean algebra A(S) of subsets of a set S, Av B and Aa B are the 
union and intersection respectively of the subsets A and B of S, 1 = S, 0 = © and for any A, A’ is 
the complementary set of A in S. We recall that a filter in a Boolean algebra B is a subset F that 
is closed under « and contains every b > any u € F. A filter is proper if and only if F £ B, 
which is equivalent to 0 ¢ F. Anultrafilter F is a proper filter that is maximal in the sense that 
it is not properly contained in any other proper filter of B. A filter F is an ultrafilter if and only 
if itis proper and for any a € B either a or its complement a’ (which is unique) is contained in 
F. 

A Boolean algebra can be made into a ring in two ways. In the first of these one defines a + 
b = (a^ b’) v (a'a b), ab =a ^ b, and takes 0 and | to be the given 0 and 1 (BAI, p. 478). In 
the second, one dualizes and takes the addition composition to be a + 'b = (av b^ a (a'y b) 
and the multiplication a-'b = a v b. The zero element is 0’ = 1 and the unit is 1' = 0. If one does 
this, it turns out that a filter in B is simply an ideal in the Boolean ring (B, +,0', 1’). As a 
consequence of this we have the following special case of Corollary 2 to Theorem 2.5. 


COROLLARY 3. (to Theorem 2.5). Any proper filter F in a Boolean algebra can be 
imbedded in an ultrafilter. 


Now let {A4,} be a set of Q-algebras indexed by a set Z and let F be a filter in the power set 
Pil): If S|, S2 € F, then S; N S2 € Fand if S € F and T > S, then T € F. Consider []A,. In this 
algebra we define a relation ~p by a ~ p if the set of indices 


(20) lap = {&EI|a, = b,j EF. 


We claim that this is a congruence on Į] J4. It is clear that this relation is symmetric, and it is 
reflexive since J € F. Now suppose a ~ pbb and b ~;c so I, p and I, ¿ E€ F. Since Iz p N Ip c 
C la, , and F is a filter, it follows that 7, 
relation on | [4,. To show that it is a congruence, we must show that if we Q(n), n = 1, and a), 
a, BD,..., B™ © TTA, satisfy a® ~ pb® 1 <i <n, then wa®...a™ ~ p wb...b™. By 
assumption, J, ,0 € F for 1 <i<n. Then V0 ,@) € F and if a is in this set, then 


_¢ E F. Hence a ~ pe. Thus ~ is an equivalence 


wa =a = ob) b™, 


Thus NLO C loa.. a™ ob,,.6- Hence the latter set is in F and consequently F = ~ p is a 
congruence on the algebra []4,. We now define the quotient algebra [[A4/ F to be the F- 
reduced product of the Q-algebras A,. If F is an ultrafilter, then [[4,/¥% is called an 
ultraproduct of the A, and if every A, = A, then we speak of an u/trapower of A. 

Ultraproducts owe their importance to the fact that any “elementary” statement valid for all 
of the A, 1s valid for every ultraproduct of the 4,. We refer the reader to books on model theory 


for a precise statement and proof of this result. An excellent reference is the chapter 
“Ultraproducts for Algebraists” by Paul Eklof in Handbook of Mathematical Logic, ed. by Jon 
Barwise, North Holland Company, 1977. An illustration of the result is 


PROPOSITION 2.1. Any ultraproduct of division rings is a division ring. 


Proof. The statement that D is a division ring is that D is a ring in which 1 + 0 and every a £ 
0 in D has a (two-sided) multiplicative inverse. (This is an example of an elementary 
statement.) Suppose the D,, a € J, are division rings. We wish to show that any ultraproduct 


[|D,/¥ determined by an ultrafilter F is a division ring. Let a be an element of [[D, such that 
a = ag + 0. Then the set J, = J, 9 = {aella, = 0p } € F. Since F is an ultrafilter, the 


complementary set /’, = {ae/|a, # Op } is in F. Define b by b, = a,l ifa e I,', b, = 0 
otherwise. Then the set of a such that a,b,= 1p = bya, is I’, and so belongs to F. Hence ab = | 
= ba in [[D,/F and D is a division ring. O 


The same argument shows that any ultraproduct of fields is a field. It should be noted that 
the proof makes strong use of the fact that F is an ultrafilter and not just a filter. In fact, the 


argument shows that [[D,/¥ f~" F a filter is a division ring if and only if the complement of any 


set S of indices that is not contained in F is contained in F, hence, if and only if F is an 
ultrafilter. 


Another result whose proof can be based on ultraproducts is 


PROPOSITION 2.2 (A. Robinson). Let R be a ring without zero divisors that is a subring 
of a direct product | |D, of division rings D,. Then R can be imbedded in a division ring. 


Proof. (M. Rabin). Let J be the index set. For each a € RC [JD, let l', = {a € lla, # OD}. 
Then /', # Ø if a # 0. Let B = {7',| a £ 0}. If a)..., a, are non-zero elements of R, then 
„ E B. Evidently, 1), 4, C (Y la, Thus the set B is a filter base in P(D) 


in the sense that Ø ¢ B, and the intersection of any finite subset of B contains an element of B. 
Then the set G of subsets of Z, which contain the sets 7', of B, is a filter in A(/) not containing 


©. By Corollary 3 to Theorem 2.5, G can be imbedded in an ultrafilter F of AV). Put D = 


ajaz. .An FO sO T'a a a 


[[D,/7. By Proposition 2.1 this is a division ring. We have the nomomorphism a ~ ag of R 
into D. We claim that this is a monomorphism. Otherwise, we have an a # 0 in R such that the 
set [, = {a € lla, =Op } € F. On the other hand, 7', = {a € lla, # Op } € B CF and J", is the 
complement of /, in Z. Since F is a filter, it contains Ø = L{W "e This contradicts the fact that F 


is an ultrafilter. Hence a ~ @g is a monomorphism and this provides an imbedding of R in a 
division ring. O 


EXERCISES 


1. Use exercise 2, p. 70 and Proposition 2.2 to prove that Z can be imbedded in a field. 


2. If a is an element of a Boolean algebra, the subset F, = {xlx = a} is a filter called the 
principal filter determined by a. Let I be a set, S a subset, F, the principal filter 
determined by S. Show that if F, is the congruence in [[,.;A4, determined by F,, then 
[A/F s = [pes dp- 

3. Show that an ultrafilter in P1), I an infinite set, is not principal if and only if it contains 
the filter consisting of the complements of finite subsets of J. 


2.7 FREE QO-ALGEBRAS 


Let Q= Ur _,Q(n) be a given set of operator symbols where Q(7) is the set of n-ary symbols, 
and let X be a non-vacuous set. Now let Y be the disjoint union of the sets Q and X and form the 
disjoint union W(Q, X) of the sets Y™, m > 1, where Y™ is the set of m-tuples (Y4, Y2 --. Ym), Yi 
e Y. To simplify the writing, we write y1y> ... Ym for (V1, Y2» --- Ym). This suggests calling the 
elements of W(Q, X) words in the alphabet Y. We define the degree of the word w = y1y2 ... Ym 
to be m. It is useful also to introduce a notion of valence of a word according to the following 
rule: The valence v(x) = 1 ifx € X, (@) =1-—nif@ € Q(n), and 

m 

00017 Yn) = D0) 

1 
for y; € Y. For example, if w = ©;0@,x,@3xx3«4°) where the superscripts on the symbols 
indicate their arities, then w has degree 7 and valence —2. 

We now introduce the juxtaposition multiplication in W(Q, X) by putting 


(Ya Vm) nN) = Va Ve 


It is clear that this product is associative, so no parentheses are required to indicate products 


of more than two words. 

Next we shall use this associative product to define an Q-algebra structure on the set WQ 

X). If we Q(0), we define an element of Y to be the element corresponding to œ in WQ, X). 
Now let we Q(n), n = 1, and let w), w>,...,.w, E WQ, X). Then we define the action of Q on 
the n-tuple (w1, W>,..., W„) as the word ww ,w>...w,. Evidently, these definitions make W(Q, X) 
an Q-algebra. We now let F(Q, X) be the subalgebra of the Q-algebra W(Q, X) generated by 
the subset X and we shall show that this is a free Q- algebra generated by X in the sense that 
any map of X into an Q-algebra A has a unique extension to a homomorphism of F(Q, X) into A. 
We shall establish first the following criterion for an element to belong to F(Q, X) and a unique 
factorization property. 
LEMMA. 4 word w e F(Q, X) if and only if its valence is 1 and the valence of any right 
factor w' of w (w = w"w' or w' = w) is positive. The subset of F(Q, X) of elements of degree | 
is X U Q(0). If w e F(Q, X) has degree > 1, then w begins with an we Q(n), n > 1, and w can 
be written in one and only one way as œw; w ... w, where w; € F(Q, X). 


Proof. The statement on the elements of degree 1 in F(Q, X) is clear. It is clear also that v(w) 
= 1 for these elements and w is the only right factor of w of degree 1 so the conditions on 
valence hold for w. Now let w e F(Q, X) have degree > 1. It is clear from the inductive 
procedure given on p. 59 for producing the elements of the subalgebra of an Q-algebra 
generated by a subset that w has the form w = www, ... w, where we Q(n) and every w; 


e F(Q, X). Using induction on the degree, we may assume that every w; satisfies the stated 
conditions on the valences. It then follows that w = ww,w, ... w, satisfies the valence 


conditions. Conversely, let w be any word that satisfies the valence conditions. If the degree of 
wis 1, then either w is anx € X or w= we Q(0). In either case, w € F(Q, X). Now suppose the 
degree ofw is greater than 1. Then the valence conditions imply that w begins with an œe Q(n), 
n= 1. Thus w= @yj)2 ... Ym where we Q(n), n > 1, and the y; € Y. For any i, 1 < i <n, let u; be 
the right factor of u = y1y2 ... Ym of maximum degree having valence i. For example, let 


w = 0," x, xX x03! x6x604 ws” 

where the superscripts give the arities of the operator symbols to which they are attached. Then 
u = X4X5X307? x04 x5x 60405 and if we write the valence of the right-hand factor, which 
begins with a symbol above this symbol, then we have 


(2) 2 (0)) (0) 


4° 3 2 Ds A (4$ 


Hence, in this case, u, = eee ain Uy = X3Uy, U3 = X32, U = Uy = XU3. In 
general, we have u, = u = y1y2 ... Ym and u; is a right factor of u; for 2 <i < n. Then we can 
write uy = W, Uy = Wy_1Uy, U3 = W,-9U>... Where the w, satisfy the valence conditions. Thus we 
have w = @y1y2. -Ym = OW] W3 ... W,. Since the degree of every w;is less than that of w, using 
induction on the degree, we may assume that w; € F(Q, X). Then w = wwyw>...w, E F(Q, X). 


This proves that the valence conditions are sufficient to insure that an element is in F(Q, X). 
Now suppose we have a second factorization of w as w= ww',w', ... w', where w;’ € F(Q, X). 


Then the valence of w’, is 1, of w',_,w', is 2, of w',.w',.;w', is 3, etc. Hence the definition of 
the u; implies that the degree deg w;'w';,) ... W, < deg ww; +)... W,, 1<i<n. If equality 
holds for all 7, then w’; = w4, w'> = W,..., Ww, = Wp. Hence assume degw7 ... w", < degw; ... W, 
for some i and let 7 be minimal for this relation. Evidently i > 1 and deg w’; ... w’, = deg w; 
p++ Wp SOW)... Wh = Wz]... Wp Then deg w'y ... w-, > deg w; ... w; and w’_, = W-Z 
where z is a word. Since v(w’;_,) = 1 = v(w,_,) we have v(z) = 0, which contradicts the valence 
conditions that must be satisfied by w’ ,_, € F(Q, X). 0 


We can use this to prove the “freeness” property of F(Q, X): 


THEOREM 2.9. Let fbe a map of X into an Q-algebra A. Then f has a unique extension to 
a homomorphism Í of F(Q, X) into A. 


Proof. Ifx e€ X, we put f (x) = f(x) and if œ € Q(0) so w e F(Q, X), then we let f (@) = wy. 
The remaining elements of F(Q, X) have the form @w,w, ... w,, where the w; € F(Q, X, 
we Q(n), n> 1. Suppose we have already defined f (w’) for all w’ of degree < deg œ Wi... Wye 
Then / (w) is defined and we extend the definition to ww, ... w, by f (aw, ... w,) = of (w)) 
... J (w„). This inductive procedure defines / for all F(Q, X). It is clear also that f is a 


homomorphism. Since X generates F(Q, X), / is unique (see exercise 3, p. 66). D 


We can express the result of Theorem 2.9 in categorical terms. We have the forgetful functor 
F from the category Q-alg to Set mapping any Q-algebra ir its carrier (underlying set) and 
algebra homomorphisms into the c respcing set maps. Then it is clear from the definition of 
universals (p. 42 that 7 is a non-vacuous set, then the pair (F(Q, X), i) where 7 is the injection 
of X into F(Q, X) constitutes a universal from X to the functor F. The fact that (F(Q, X), i) is a 
universal implies its uniqueness in the usual sense. 


2.8 VARIETIES 


The concept of a free Q-algebra F(Q, X) permits us to define identities for Q-algebras. Let 
(w1, w2) be a pair of elements in F(Q, X) for some set X. Then we say that an Q-algebra A 


satisfies the identity w} = w, (or w; = w, is a law in A) if Aw) = Aw) for every 
homomorphism f of F(Q, X) into A. If S is a subset of F(Q, X) x F(Q, X), then the class of Q- 
algebras satisfying the identities w} = w, for all (w;, w2) € S is called the variety of Q- 
algebras V(S) defined by S (or the equational class defined by S). For example, let o®, wo, 
w® be respectively binary, unary, and nullary operator symbols, Q = {o@®, ow, @}, X = 


{X1, X2, X3}; then the class of groups is defined by the following subset of F(Q, X) x F(Q, X): 


(2)-,(2)y 


(wwx xx 0x, xx) 


(w! Dy 1X1 ) 


(0) 


(wxw ,Xı) 


j 
(ox wx, w) 


(2) o, 


(w XX1 0 


For, the statement that A is an Q-algebra for the indicated Q is that A is equipped with a binary, 
a unary, and a nullary composition. If we denote the effect of these in A by ab, a', and 1 
respectively, then the statement that A is in V(S) for S, the set of five elements we have listed, 
amounts to the following laws in A: (ab)c = a(bc), la = a, al = a, aa! = 1, a ‘a = 1. 
Evidently, this means simply that A is a group, so V(S) is the variety of groups. In a similar 
manner one sees that the classes of monoids, of lattices, of rings, of commutative rings, and of 
Boolean algebras are varieties. 

It is easily seen that in defining varieties there is no loss in generality in assuming that X is 
a countably infinite set Xo = {x, X>,...}. At any rate, we shall do this from now on. Let V(S)) 


be the variety of Q-algebras defined by the subset S of F(Q , Xo) x F(Q, Xo). It is clear that if A 


€ V(S), then any subalgebra of A is contained in V(S). Moreover, every homomorphic image 4 
of A is contained in V(S). To see this we observe that any homomorphism / of F(Q, Xo) into A 


can be lifted to a homomorphism f of F(Q, Xo) into A in the sense that if à is a surjective 
homomorphism of A onto 4, then there exists a homomorphism f: F(Q, Xo) — A such that f= 
Af. To define such an f we choose for each i = 1, 2,... an element a; € A such that A(a;) = F (x) 
and we let f be the homomorphism of F(Q, Xp) into A extending the map x; a; of Xo into A. Then 
Afix) = Aa) = f (x). Since Xp generates F(Q, Xo), we have Af = f for the homomorphisms 47 
and f of F(Q, Xp) into A. Now suppose (w), w2) € S and let f be any homomorphism of F(Q, 
Xo) into A, fa homomorphism of F(Q, Xo) into A such that 4f = f . Then f (w,) = Afw) = Af(w>) 
= F (wy). Thus A satisfies every identity w, = w, for (w4, W2) € Sand hence A € (S). 

We note next that if {4,|ae/} is an indexed set of algebras contained in V(S), then []4, € 
V(S). This is clear since if p, denotes the projection of A = [JA, into A, and f is a 
homomorphism of F(Q, Xo) into A, then p,f(w,) = p,f(w2) for every (w1, w2) € S. Then Aw) = 
flw) and hence A € (S). 


Any variety V(S) defines a full subcategory V(S) of Q-alg. We shall now consider some of 
the important properties of such categories and we prove first 


PROPOSITION 2.3. 4 morphism in V(S) is monic if and only if it is an injective map. 


Proof. The proof is an immediate generalization of the one given on p. 17 in the special case 
of the category of rings. If fis an injective homomorphism of A into B, 4,B e V(S), then f is 
monic. Now suppose f:A — B is not injective. We have the subalgebra © =f f of A x A (the 
kernel of f) and ® e (S). Let p and p' be the restrictions to ® of the projections (a, a’) ~ a 
and (a, a’) ~ a’. Since (a, a’) € Ọ is equivalent to f (a) = f (a') we have fp = fp’. On the other 
hand, since fis not injective, we have a pair (a, a’) such that f(a) = f(a’) and a + a’. Then (a, a’) 
€ ® and p(a, a' + p'(a, a). Since fp = fp’, this shows that fis not monic in V(S). D 


We recall that in the category of rings, epics need not be surjective. Hence the analogue for 
epics of Proposition 2.3 is not valid. 


If C is a subcategory of Q-alg and X is a set, we call a universal from X to the forgetful 
functor from C to Set a free algebra for C determined by X. We have the following extension 
of Theorem 2.9 to varieties: 


THEOREM 2.10. Let V(S) be the category defined by a variety and let X be a non- 
vacuous set. Then there exists a free algebra for V(S) determined by X. 


Proof. Consider the free algebras F(Q, X) and F(Q, Xo) where Xo = {x1, X2,...}. Then S is a 
subset of F(Q, Xo) x F(Q, Xo). If g is a homomorphism of F(Q, Xo) into F(Q, X) and (wy, w2) 
e S, then (g(w,), o(w2)) € F(Q, X) x F(Q, X). Let ®(S) denote the congruence on F(Q, X) 
generated by all the elements (g(w,), (w2)) obtained from all the homomorphisms ọ and all of 
the pairs (Ww), w) € S. Put FQ, X = FQ, X)/P(S) and let v be the canonical 
homomorphism a ~ 4 gs) of F(Q, X) onto Fy) (Q2, X). We claim that Fy5)(Q, X) together with 
the restriction i of y to X constitutes a free V(S)-algebra for the set X. We show first that Fy(s) 
(Q, X) e WS). Let ? be a homomorphism of F(Q, Xo) into Fys)(2, X). Then as we showed 
above, there exists a homomorphism ”: F(Q, Xo) —> F(Q, X) such that ? = v?. If (w), w2) € S, 
((pw,), G(W)) € B(S), so ve(w,) = ve(wy). Thus P(w) = P(w»). Since this holds for every 
(w1, w2) € Sand all homomorphisms ¢ of F(Q, Xo) into Fyig(Q, X), we see that Fyys)(Q, X) € 
V(S). Next suppose g is a map of X into an algebra A €e V(S). Then g can be extended to a 
homomorphism 9 of F(Q, X) into A. If ọ is a homomorphism of F(Q, Xo) into F(Q, X), then 9@ 
is a homomorphism of F(Q, Xo) into A. Hence, if (w;, w2) € S, then 9p(w,) = 9e(w>). Then 
(g(w,), ø(w2)) is in the kernel ker 9. Since this holds for all (w,, wə) € S and all 
homomorphisms p of F(Q, Xo) into F(Q, X), the congruence ®(S) c ker 9. Hence we have a 
unique homomorphism f of Fy(g)(Q, X = F(Q, X)/®(S) into A such that 


F(2,X) m Frys; (Q.X) 


g 


is commutative. Then for i = v\X we have the commutative diagram 


x : Frys; (2,X) 


A 


Since X generates F(Q, X), i(X) generates Fys)(Q, X). Hence fis unique and (Fy(s)(Q, X), i) is 
a free algebra for V(S) determined by X. 0 


COROLLARY. Any algebra in a variety is a homomorphic image of a free algebra. 


Proof. Let X be a set of generators for the given algebra A € V(S). For example, we may take 
X = A. Then we have a homomorphism of Fy5)(Q, X) into A whose image includes X. Since X 


generates A, the image is A. Thus we have a surjective homomorphism of F'y5(Q, X) onto A. O 


The result we noted before—that if {A,|aeJ/} is an indexed set of algebras contained in a 
variety V(S) then | [4,€ V(S)—implies that products exist in V(S) for any indexed set of objects 
in this category. This holds also for coproducts: 


THEOREM 2.11. Coproducts exist in V(S) for any indexed set {A,|a € I} of algebras in 
V(S). 


Proof. We reduce the proof to the case of free algebras in (S) by showing that if a coproduct 
exists for a set of algebras {Aa|ael}, A,€V/(S), then it exists for every set {4,|a € I} where A, 


is a homomorphic image of A, Let {4, i,} = UA, in V(S) and let 7, be a surjective 
homomorphism of A, onto 4,, a € I. Let ©, be the kernel of n, and let 1,0, denote the subset 
of A x A of elements (i,(a,), i,(b,)) for (ay, Da) E Oy. Next, let © be the congruence on A 
generated by U iOa and put A = 4/0, i’, = vi, Where v is the canonical homomorphism of A 
onto A = A/O. Then i’, is a homomorphism of A, into A and if (a ba) E Oy (ig(dg), ig(b,)) € 
O, so vi,,(a,) = vi,(b,). Thus ©, is contained in the kernel of i’, = vi,; consequently we have a 
unique homomorphism i ,: 4, — A, making the diagram 


Aa i. A 
Tha v 
A. z A 


commutative. Now A e V(S) since it is a homomorphic image of A e V (S). 
We claim that {4, i,,} = uA,. Let B € V(S) and for each a € J let / , be a homomorphism of 


A, into B. Then f, = / „a is a homomorphism of A, into B whose kernel contains ©,. Since 
{A, i,} = UA,, we have a unique homomorphism /: A — B such that fi, = fọ a € I. If (a,, ba) 
Ee ©, then 7,(a,) = 7,(b,) and f(a.) = fina.) =f.n(b.) = 56), Thus fi,(a,) = fi,(b,), 80 i, Og is 
contained in the kernel of f. Since this holds for all a and since © is the congruence on A 
generated by iO, we see that © is contained in the kernel of f. Hence we have a unique 
homomorphism / : A — B such that 


~] 


A 


is commutative. Now we have the diagram 


in which the rectangle and the triangles 4AB and 4,4B are commutative. Now the “diagram 
chase” J isha = via = fia =F afa and the fact that n, is surjective, hence epic, imply that fi, = 
J œ a € I. To prove the uniqueness of f we observe that f : A — B satisfies f = Jip a € I, 
and we define f = J v, then the rectangle and the triangles 44B and 4,AB in the foregoing 
diagram commute. Then the triangle 4,4B is commutative, since f, = J ga = S iola = Í via = 
fia Since {A, i,} =A, this implies that fis unique. Since v is surjective, / is unique. Hence {4 
, tf = A,. 

The result just proved and the fact that every A, € V(S) is a homomorphic image of a free 
algebra in V(S) imply that it suffices to prove the theorem for free algebras A, = Fy5)(2, Xa). 
For this purpose we form A = Fy), X) where X = Ux, the disjoint union of the X,. We 
denote the maps X — A, X, — A, given in the definition of the free algebras in V(S) 
indiscriminately by į and let i, be the homomorphism 4, — A corresponding to the injection of 
X,, into X. Hence i iX) = i(x,). Now suppose we have homomorphisms /,..4, — B where B € 
V(S). Let g be the map of X into B such that g(x,) = fa i(x,), a € I. Then we have a 
homomorphism f of A into B such that fi(x) = g(x), x e X. Then fi(x,) = 2&4 = foi) = 
fi,i(x,,). Since the elements i(x,) generate 4,, we have fi, =f, a € I. Since the elements i(x), 
Xg E X,a€ I, generate A, fis unique. Hence {A, i,} = A,. O 


The preceding construction of UA, = {A, ią} in a variety shows that A is generated by the 
images i,(A,). This can also be seen directly from the definition of a coproduct of algebras. Of 
particular interest is the situation in which the 7, are injective. In this case we shall call u A, 
the free product. A useful sufficient condition for the coproduct to be a free product is given in 


PROPOSITION 2.4. Let uA,be a coproduct of Q-algebras in a category C. Then UA, is a 
free product if every A, has a one-element subalgebra {&,}. 


Note that this is applicable in the category of monoids and of groups, but not in the category 
of rings. 


Proof. If A is any Q-algebra, the map of A into Ag sending every element of A into eç is a 
homomorphism. Let fg be this homomorphism for 4 = A, into A, if a # p and let fgg = 1 4p. 
Applying the definition of a coproduct, we obtain a homomorphism fg: A = 4, — Ap such that 
fap = Joia & € I. Since fgg is injective, iz is injective. Thus every ig is injective and A, is a free 
product. 0 


EXERCISES 


— 


. Give an example in the category of rings in which the coproduct is not a free product. 


2. Let F-(Q, X) be a free algebra for a category C of Q-algebras determined by a set X. 


Show that if à is a surjective homomorphism of A into A where A € ob C and f is a 
homomorphism of Fc {Q, X) into A, then there exists a homomorphism f of F¢(Q, X) into 


A such that f= ìf. 


3. Let Fe(Q, X) be as in exercise 2. Prove the following: (i) Fe(Q, X) is generated by i(X) 
(i the canonical map of X into F¢(Q, X)). (11) If Y is a non-vacuous subset of X, then the 
subalgebra of Fc(Q, X) generated by i(Y) together with the restriction of 7 to Y regarded 


as a map into the indicated subalgebra constitutes a free C algebra determined by Y. (111) 
Let X’ be a second set, F-(Q, X’) a free C algebra for X. Show that if |X| = |X’|, then 


F(Q, X) and F¢(Q, X’) are isomorphic. 
4. Show that if (S) contains an algebra A with |4| > 1, then it contains algebras of 


cardinality exceeding any given cardinal. Use this to prove that for any X the map i of X 
into Fys)(Q, X) is injective if V(S) contains non-trivial algebras. 


5. Show that if the 4,¢ V(S) for aeJ/, a pre-ordered set, and lim A, exists, then lim A, € 
V(S). Show also that if J is directed, then lim A, € (S). Show that any ultraproduct of 
A, E V(S) is contained in V(S). 


6. Let FM” be the free monoid defined by X = {x}, x5, ..., x,} as in BAI, p. 67. FMM 
consists of 1 and the monomials (or sequences) x;,x;,...x;, where 1 < i; < r, and the 
product is defined by juxtaposition and the condition that 1 is a unit. Form the free Z- 


module with FM” as base and define a product by 


(Lriw) (Lrjw)) = Lrirjwiw; 


for r,r;'€ Z, wp wi, € FM”. This defines a ring 2{X} = Z/FM™] (exercise 8, p. 127 of 
BAI). Show that Z2{X} together with the injection map of X into Z{X} constitutes a free 
ring determined by X. 


2.9 FREE PRODUCTS OF GROUPS 


We shall now have a closer look at the coproduct of an indexed set {G,|A € J} of groups G,- 
By Proposition 2.4, Q G, is a free product. In group theory it is customary to denote this group 
as [[*G, and to identify G, with its image 7,(G,) in [|*G,. When this is done we have the 
following situation: []*G, contains the G, as subgroups and is generated by UG,. Moreover, if 
H is any group and f, is a homomorphism of G, into H for every a € J, then there is a unique 
homomorphism f of [[*G, into H such that f| G; =/,, a € L. 


If G is any group containing subgroups G,, a € I, and generated by UG., then every element 
of G is a product of elements of the G,. This is clear, since the set G’ of these products contains 


UG, and thus contains 1. Moreover, G' is closed under multiplication and taking inverses. 
Hence G’ is a subgroup, so G’ = G. It is clear that if we take a product of elements of the G,, 


then we may suppress the factors = 1 and we may replace any product of elements that are in 
the same G, by a single element in G,. In this way we may replace any product by one that 


either is 1 or has the form x,x, ... x, where x; # 1 and if x; E G,, thenx,,, £ Ga Products of 
this form are called reduced. We shall now show that in the free product [[*G, anY two 


reduced products that look different are different. For this purpose we shall give an alternative, 
more direct construction of [|[*G,. There are a number of ways of doing this. The one we have 


chosen is a particularly simple one due to van der Waerden. 

We start from scratch. Given a set of groups {G,EZ}, let G’, be the subset of G, of elements 
+ 1. Form the disjoint union S of the G', and {1}. In this set we can identify G, with G'a U 
{1}. Then we shall have a set that is the union of the G,; moreover, Gy N Gz= 1 if a # p. Now 
let G be the set of reduced words based on the G,. By this we mean either 1 or the words xx, 
.. X, Where x; # 1 and ifx; € G,, thenx,,, £ Gaus 1 <i<n-— 1. We shall now define an action of 
G, on G(cf. BAI, pp. 71-73) by the following rules: 


(21) Ilw=w forany weG. 


Ifx #1inG,, thenxl =x and ifx, ... x, is a reduced word £ 1, then 
xxx, E x, éG, 


(22) XX1°°°Xy_ = (XX,)XQ°°°X, If x, EG, and xx, #1 


X, X, Wf x,EG, and xx, = 1. 
In the last formula, it is understood that if n = 1, so that an empty symbol results, then this is 
taken to be 1. 

Our definitions give xw € G for x € G,, w € G. We shall now verify the axioms for an 
action: lw = w and (xy)w = x(yw) for x, y e G, and w €e G. The first is the definition (21) and 
the second is clear if either x, y, or wis 1 or ifx, ¢ G, It remains to consider the situation in 
which x, y, x; E Ga x #1, y# 1. We distinguish three cases: (a) yx, £ 1, xx, £ 1; (A) yx, #1, 
xyx, = 15 (y) yx, = 1, xyx, # 15. Note that yx, = 1, xyx; = 1 is ruled out, since x = 1 in this case. 


(a) Here (xy)x, ... Xp = (xyx))x2 ... x, and x(x]... Xp) = X(VX Xo... Xy = (AVX) «.. Xp, 
SO (XV )xX1 <<. Xy Z XVX] ... Xp). 

(P) Here (xy)x, ... Xy =X... X, and x(x]... Xp) = XX1 )X2 ... Xp =Xq «.. Xp, SO (Xy)X] ... 
Xy Z X(YX] oes Xp). 

(y) Here (xy)x; ... Xy = (XYX1)X2 -.. Xy = AX... X, and XX, ... Xp) = XX2 «.. Xp) EXX ... 
X,, SO again (xy)xX] ...X, = XVX] ... Xy). 

If T, denotes the map w xw for we G and x € G,, then we have 7,7, = Ty and T; = 1. 
Hence T, is bijective with inverse 7\-1 and x ~ T, is a homomorphism of G, into the group 
Sym G of bijective transformations of the set G. Since T, 1 =x, x ~ T, is injective, so G, is 
isomorphic to its image 7(G,), which is a subgroup of Sym G. Let G* denote the subgroup of 
Sym G generated by Ur al): Let T,, ... Iy, be a reduced product of elements of Ur (0) 
where no x; = 1. The formulas (21) and (22) give T,, ... T}, 1 =x, ... Xp. Then 7, ... T, #1 
and if T, ... T, is a second such product, then 7, ... 7, =T,, ... J, implies that m = n and 
x;=y;, 1 <i< n. We can identify the 7(G,) with G,. Then we have proved the first conclusion 
in the following. 


THEOREM 2.12. Let {G,\ael} be a set of groups. Then there exists a group G* 
containing the G, as subgroups that is generated by UG, and has the property that if x, ... 


X_ ANA Yy ... Vip Xi Y; € UG., are reduced products relative to the G,, then x, ... Xp, =... Ym 
implies that m = n and x; = y;, 1 <i < n. Moreover, if G* is any group having this property 


and i, is the injection of G, in G, then {G*, i,} is a free product of the G,. 


Proof. To prove the last statement we invoke the existence of the free product []*G,, which is 
a consequence of Theorem 2.11, and Proposition 2.4. By the defining property of [|[*G, we 
have a homomorphism y of | [*G, into G*, which is the identity map on every G,. We claim 7 
is an isomorphism, which will imply that {G*,i,} is a free product of the G,. Since the G, 
generate G*, y is surjective. Now let a be any element £ 1 in | [*G,. Then we can write a in the 
reduced forma = x, ... x, where the ;# 1 and successive x; are in different G,. Then (a) = x, 
... X, # 1 in G*. Hence ker y (in the usual sense) is 1 so 7 is injective. O 


The free product can be used to give an alternative construction of the free group FG(X) 
determined by a non-vacuous set X, which is more explicit than the ones we have considered 
before (in BAI or in the previous section). With each x € X we associate the infinite cyclic 
group <x?) of powers x*, k = 0, + 1, + 2,..., and we construct the free product []*,-y <x? of the 
groups <x? as above. This contains the subgroups <x? and has the properties stated for the G, 
in Theorem 2.12. Thus []*,- <x? consists of 1 and the products x ‘ix, ... x,n where the x; € 
X, k, =+1, +2, ..., and x; #x;,, for 1 < i < n—1. Moreover, these products are all #1 and two of 
them are equal only if they look alike. Evidently []*,- Sx contains the subset of elements x!, 


which we can identify with X. Now suppose fis a map of X into a group H. Then for each xe X 
we have the homomorphism f. of <x? into H sending x ~ f(x). Hence, by the property of []* ex 
Cx?) as free product of its subgroups <x}, we have a unique homomorphism / of []*,<<x> into 
H such that for every x the restriction of f to <x> is f.. Evidently, this means that f(x) = f(x) 
and so we have a homomorphism of [ ]* < $x? into H, which extends the given map f of X into 
H. It is clear also that there is only one such homomorphism f. Hence []*,-<x> is a free 
group determined by the set X. The uniqueness of free groups in the category sense permits us 
to state the result we have obtained from this construction in the following way: 


THEOREM 2.13. Let X be a non-vacuous set, (FG(X),i) a free group determined by X 
where i denotes the given map of X into FG(X). Then i is injective, so we may identify X with 
i(X). If we do this then we can write any element #1 of FG(X) in the reduced form xfx}? ... 


x,» where the k,=+ 1, +2,... and consecutive x; are different. Moreover, any element of the 


l ky 


form indicated is #1 and if y,!\y5!2 ... y,/m is a second element in reduced form, then x)" x, 


a.. X_kn = yim implies m =n, y; = x;k; =l, 1<i<n. 


The usefulness of the normal forms of elements of free products and free groups given in 
Theorems 2.12 and 2.13 will be illustrated in the exercises. 


EXERCISES 


1. Show that if G, # 1, a € J, and |Z| > 1, then the center of [|*G, is 1. 
2. Show that the free group F'G(X) has no elements of finite order. 


3. Let []*G, be as in exercise 1. Determine the elements of finite order in [ [*G,. 


The next two exercises are designed to prove that the group of transformations of the form z 
az+b 


+» cz+d,a, b, c, d € Z, ad — bc = 1, of the complex plane plus oo is a free product of a cyclic 
group of order two and a cyclic group of order three. We note a fact, easily verified, that this 
group is isomorphic to SL,(Z)/{ 1, -1} where SL,(Z) is the group of 2 x 2 integral matrices of 


0 © 0 
determinant 1 and 1 = \® !1/,-1= 0-1) 


11 1 0 
4. Show that SL,(Z) is generated by the matrices (o 1) and ( 1) Let 


=r z 

Stevo) MZSS, Verify that S? = 1, 7 = 1, ST:z wz + 1, ST? :Z »z+1. Hence 
az+b 

prove that the group G of transformations z +ez+d, a, b, c, d € Z, ad— bc = 1 of EY {%0} 

is generated by S and 7. 


5. Show that all products of the form 


(ST)(ST?)8(ST)8 = or (SF2)E(STA(ST2)8 
az +o 
with k; > 0 can be written as z Z ez+d where a,b,c,d = 0 and at most one of these is 0. 
Hence show that no such product is 1 or S. Use this to prove that G is a free product of a 
cyclic group of order two and a cyclic group of order three. 
6. Let Z{X} be the free ring generated by X = {x}, x,... .x,,} as in exercise 6, p. 86, and let 
B be the ideal in 2{X) generated by the elements x”, x7’,...,x’,,. Put 4 = Z {X}/B. Call a 


2 


word in the x’s standard if no x7 occurs in it. Show that the cosets of 1 and of the 


standard words form a Z-base for A. Identify 1 and the standard words with their cosets. 
Then A has a Z-base consisting of 1 and the standard words with the obvious 


multiplication, which results in either 1, a standard word, or 0 (e.g., x7 = 0). Put y; = 1 + 
x; 1<i<n. Note that y; has the inverse 1 — x; so the y, generate a group under the 
multiplication in 4. Show that this is the free group FG(Y), Y= {y1, yo, -<-s Ynt- 


7. Let FG(Y) be as in exercise 6 and let FG(Y);, k = 1, 2, 3,..., be the subset of FG(Y) of a 
= 1(mod(X)‘) where X is the ideal in A generated by x)...,x,. Show that FG(Y), is a 
normal subgroup of FG(Y) and EP _ \FG(Y), = 1. If G is any group, define G” 


inductively by G = G and GO is the subgroup generated by the commutators xyx y~? 


where xeG") and yeG. One has G® > G2) > GO) > ... and this is called the Jower 


central series for G. Prove that F(Y) c F(Y). Hence prove Magnus’ theorem: N 
FG(Y) = 1. 


8. Let B, A,, a E I, be algebras ina variety V(S), and f, a homomorphism of B > 4,, a € I. 
Show that there exists an amalgamated sum (or pushout) of the f, in the following 
sense: an algebra A € V(S) and homomorphisms i, : 4, —> 4, a € I, such that i, fa = ifs 
for all a, J, and if Ce (S) and g,: A, > C satisfies g,f, = pf for all a, p, then there 
is a unique homomorphism g : A — C such that gi, = g,, a € I. 


9. Let {G,|a © I} be a set of groups all containing the same subgroup H. Construct a group 
G with the following properties: (i) G contains every G, as a subgroup so that G, N G5 
= Hifa + P. (ii) If X, is a set of representatives of the right cosets of H in G,, a € J, and 


X= Ux, then any element of G has a unique representation as a product hx x) ... x, 
where h € H, x; € X and if x; € X, then x,,,; £ X,, 1 <i<n-— 1. (We allow n = 0, in 
which case it is understood that the element is in H.) G is called the free product of the 
G with amalgamated subgroup H and is denoted as | [* 4G, 


2.10 INTERNAL CHARACTERIZATION OF VARIETIES 


Let C = V(S) be a variety of Q-algebras defined by a set of identities S. Here S is a subset of 
F(Q, Xo) x F(Q, Xo) where Xo = {X1, Xo, X3,...} and F(Q, Xo) is the free Q-algebra determined 


by Xo. We have seen that C has the following closure properties: 


1. IfA e C, then every subalgebra of A is in C. 
2. If A e C, then every homomorphic image of A is in C. 
3. If {A,|ael} DC, then [ JA, € C. 


Our principal goal in this section is to prove the converse: If C is any class of Q-algebras 
satisfying 1, 2, and 3, then C is a variety. 

As a preliminary to the proof we consider the identities satisfied by a given Q-algebra A. 
For our purposes, we need to consider identities in any set of elements and not just the 
standard set Xo. Accordingly, let X be any non- vacuous set, and F(Q, X) the free Q-algebra 
determined by X. If rj is a homomorphism of F(Q, X) into A, then the kernel kery is a 
congruence on F(Q, X) and F(Q, X)/kery is isomorphic to a subalgebra of A. Now put [d(X, A) 
= Okern where the intersection is taken over all the homomorphisms 7 of F(Q, X) into A. Then 
Id(X, A) is the set of elements (w,, w2), we F(Q, X), such that n(w,) = n(w2) for every 
homomorphism 7 of F(Q, X) into A. Thus /d(X, A) is the subset of F(Q, X x F(Q, X) of 
identities in the set X satisfied by the algebra A. Evidently, [d(X, A) is a congruence on F(Q, 
X). We call this the congruence of identities in X of the algebra A. We note that F(Q, X)/Id(X, 


A) is a subdirect product of the algebras F(Q, X)/kery (exercise 4, p. 70); hence F(Q, X)/Id(X, 
A) is a subdirect product of subalgebras of A. 

Next let C be any class of Q-algebras. We wish to consider the set of identities in X 
satisfied by every A € C. Evidently this set is IDX, C) = Myecld(X% A). Id(X% ©) is a 
congruence that we shall call the congruence of identities in X for the class C. It is clear that 
F(Q, X)/Id(X, C) is a subdirect product of algebras that are subalgebras of algebras in the 
class C. 

Now suppose C has the closure properties 1, 2, and 3 above. Then it is clear that any 
subdirect product of algebras in the class C is an algebra in C. It follows that F(Q, X)/Id{X, C) 
e C. Now let S be the congruence of identities in Xo for the class C: S = Id(Xo, C), Xo the 


standard set, and let K(S) be the variety defined by 5. We shall show that C = V(S). This will 
be an immediate consequence of 


THEOREM 2.14. IfX is infinite, then F(Q, X)/Id(X, C) together with the canonical map x 
> Xa x œ constitutes a free algebra for V(S) determined by X. 


Proof. We recall the construction given in section 2.8 for a free algebra Fs), X) for the 
variety V(S) and the set X: Fus), X) = F(Q, X)/®(S) where O(S) is the congruence on F(Q, 
X) generated by all pairs (ø(w1), p(w>)) where (w, w2) E S and g is a homomorphism of F(Q, 
Xo) into F(Q, X). Our result will follow if we can show that B(S) = Id(X, C). We have seen 
that the closure properties of C imply that F(Q, X)/Id(X¥%,C) e C. Hence we have the 
homomorphism of F(Q, X)/®(S) into F(Q, X/Id(¥, C) sending *(gg) into ux o 
Consequently, ®(S) c Id(X, C). Now let (z1, z2) € Id(X, C). Then z, and z, are contained in a 
subalgebra F(Q, X’) of F(Q, X) generated by a countable subset X' of X. We have maps ¢:X, —> 
X, A:X — Xo such that (Xp) = X"AC = 1 Xy These have unique extensions to homomorphisms Ç: 
F(Q, X) > FQ, X), 1: F(Q, X) > F(Q, Xp) such that (F(Q, Xp) = F(Q, Xp) and 2= 1 ro 
x: Choose w; € F(Q, Xo) so that Š (w) =z, i= 1, 2. Let fbe a homomorphism of F(Q, Xp) into 
an A € C. Then f= flava, x) = pe =f where f = fA is a homomorphism of F(Q, X) into A. 
Then f (z,) =F (z2) since (z4, z2) € Id(X, C) and hence fiw) = f (wp =F ep = F(z») = Kw). 
f Thu f(w,) = fíw2) for every homomorphism f of F(Q, Xp) into an algebra A € C. This means 
that (w;, w») € S = Id(Xp, C). Then (z;, z2) = (w1), §(wy)) € @(S); hence, Id(X, C) c ®(S) 
and so Id(X, C) = ®(S). D 


We can now prove the main result. 


THEOREM 2.15 (Birkhoff). A class C of Q-algebras is a variety if and only if it has the 
closure properties 1, 2, and 3 listed above. 


Proof. Itis clear that C c V(S) where S is the set of identities in Xo satisfied by every A e C. 


On the other hand, if 4 € (S), A is a homomorphic image of an algebra F(Q, X)/Id(X,C) for 
some infinite set X. Since F(Q, X)/Id(X, C) € C, it follows that A e C. Hence V(S) c C and so 
S) = cC. 0 
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Modules 


In this chapter we resume the study of modules which we initiated in BAI, Chapter 3. Our 
earlier discussion focused on the structure theory of finitely generated modules over a 
principal ideal domain and its applications to the structure theory of finitely generated abelian 
groups and the theory of a single linear transformation in a finite-dimensional vector space. 
The present chapter does not have such a singleness of purpose or immediacy of objective. It is 
devoted to the study of modules for their own sake and with a view of applications that occur 
later. 

The theory of modules is of central importance in several areas of algebra, notably structure 
theory of rings, representation theory of rings and of groups, and homological algebra. These 
will be developed in the three chapters that follow. 

We shall begin our study by noting the special features of the categories R-mod and mod-R 
of left and right modules respectively over the ring R. The most important of these is that for 
any pair of modules (M, N) the hom set homp,(M, N) is an abelian group. This has led to the 


definition of an abelian category. A substantial part of the theory of modules can be absorbed 
into the more general theory of abelian categories. However, this is not the case for some 
important parts of module theory. For this reason and for the added gain of concreteness, we 
shall stick to the case of modules in this chapter. 

The topics we shall consider are basic properties of categories R-mod and mod-R (section 
1), structure theory of modules (sections 2—5), tensor products of modules and bimodules 
(sections 6—8), projective and injective modules (sections 9-10), and Morita theory (sections 
11-14). The concepts of algebras and coalgebras over commutative rings will be defined by 
means of tensor products. In this connection, we shall single out some important examples: 
tensor algebras, symmetric algebras, and exterior algebras. As an application of the Morita 
theory, we shall give a proof of the Wedderburn-Artin structure theorem on simple rings. We 
shall not attempt to spell out in greater detail the topics that will be discussed. The section 
titles will perhaps be sufficiently indicative of these. 


3.1 THE CATEGORIES R-mod AND mod-R 


We begin our systematic study of modules by adopting the category point of view, that is, we 
shall consider first the special features of the categories R-mod and mod-R of left and right 
modules for a given ring R. We have seen that we may pass freely from left to right modules 
and vice versa by changing R to its opposite ring R. Hence, until we have to consider 
relations between left and right modules for the same ring R, we may confine our attention 
either to left or to right modules. Since there is a slight notational advantage in having the ring 
R and .R-homomorphisms act on opposite sides, we shall give preference to right modules and 
to the category mod-R in this chapter. 

Perhaps the most important fact about mod-R (which we have hitherto ignored) is that the 
hom sets for this category have a natural abelian group structure. If M and N € ob mod-R, then 
there is a natural way of introducing a group structure in the set homp (M, N). If f and g e hom 


(M, N) (= home (M, N), then we define f+ g by 
(ft+g)x=fx+gx,  xeM. 


Here we have abbreviated f(x) to fx, etc. We define the map 0 from M to N by 0x = 0 and — f by 
(—f)x =—fx, x € M. Direct verification shows that f+ g, 0, and — f e hom(M, N), and (hom 
(M, N), +’ 0) is an abelian group (BAI, pp. 168 — 169). 

We observe next that the product of morphisms in our category is distributive on both sides 
with respect to addition. Let P be a third module. Then for f, g € hom (M, N) and A, k € hom 
(N, P) we have 


h(f+g)=hf+hg,  (h+k)f= hf+kf 


In particular, if M = N = P, then we have the addition + and the multiplication, the composite of 
maps, in End M = hom (M, M). Moreover, the identity l} e End M and this acts as unit with 
respect to multiplication. Thus (End M, +, 1, 0, 1) is a ring, the ring of endomorphisms of the 
module M. 

An important module concept (which originated in algebraic topology) is that of an exact 
sequence of modules and homomorphisms. To begin with, we call a diagram of module 
homomorphisms 


t) q 
M-N-P 
exact if im f( =f (M)) = ker g, that is, gy = 0 for y e N if and only if there exists an x € M 
such that fx = y. More generally, a sequence of modules and homomorphisms 


Si J 
iiir: M; =p M, = M, MEUS 


that may be finite or run to infinity in either direction is called exact if for any three 


consecutive terms the subsequence M; — M; + , — M; > is exact. The exactness of 


J 
() — M ais N 


means that ker f= 0, which is equivalent to: f is injective. It is customary to write M 4 N for 
“0 — M & Nis exact.” Similarly “M 4 N— 0 is exact” is equivalent to: fis surjective. This 
can also be indicated by M +N. 

An exact sequence of the form 


J g 
0> M'>M -> M" >90 


is called a short exact sequence. This means that f is a monomorphism, g is an epimorphism, 
and ker g = im f. A special case of this is obtained by taking a submodule N of a module M and 
the quotient M/N. Then we have the injection i of N into M and the canonical homomorphism v 
of M into M/N. The first is injective, the second surjective, and ker v = N = im i. Hence we 
have the short exact sequence 


0>Nċ M >MJ/N=>0. 


Iff: M — N, we define the cokernel, coker f, as N/im f. Evidently, fis surjective if and only 
if coker f= 0. In any case we have 


kerf MSN >» coker f, 


that is, 0 > ker f + M 4 N — coker f— 0 is exact. 
We have seen in Chapter 1, pp. 35 — 36, that products and coproducts exist for arbitrary 
indexed sets of modules. If {M, |a € /} is an indexed set of right R-modules, then the product 


in mod-R is {IIM,, py} where p, is the projection homomorphism of IIM, onto M,. Moreover, 
if ® M, is the submodule of IIM, of elements (x) having only a finite number of the x, # 0, and 
if i, is the homomorphism of M, into ® M, sending the element x, e M, into the element of ® 
M, whose value at a is x, and whose remaining components are 0, then {@ M,, ip} = U M, the 
coproduct of the M. 

Now let J be finite: 7 = {1, 2,..., n}. Then clearly M=UiM:= iM: Hence this module and 
the maps i},...,7,, constitute a coproduct of the M, and M and the maps p),..., p, constitute a 
product of the M;. It is immediate from the definition of the i; and the p; that we have the 
following relations on these homomorphisms: 


(1) Pii = lm, Pi; = 9 if J#k 
i Py HiP te +i,P, = Lu: 


An important observation is that these conditions on a set of homomorphisms characterize the 
module M up to isomorphism. Precisely, suppose we have a module M' and homomorphisms p 


pola, and i'; : M; > M',1 <j <n, such that 


Pi=lu, pip =O if j#k 


n 


X ip = lw- 


1 


(2) 


Then 


n 


n 
(3) = Lip, O=)ip, 
1 1 
are homomorphisms M' — M and M — M' respectively. By (1) and (2) we have 0'9 = Im, 00" 
= ] w so 0 and 6’ are isomorphisms and 6’ = @ !. Moreover, Gi; = i';: M; > M'. It follows from 
the definition of a coproduct that M' and the i’; constitute a coproduct of the M;. Similarly, p;0' 
= p';, and M' and the p'; constitute a product of the M, in mod-R. 


In dealing with functors between categories of modules (for possibly different rings) we are 
primarily interested in the functors that are additive in the sense that for any pair of modules 
(M, N) the map f~ F (f) of hom (M, N) into hom (FM, FN) is a group homomorphism. From 
now on, unless the contrary is stated explicitly, functors between categories of modules will 
always be assumed to be additive. 

The foregoing characterization of the product and coproduct of M,, M,,..., M, by the 
relations (1) on the i; and p, implies that any functor F from a category mod-R to a category 
mod-S respects finite products and coproducts. Let M and the i; and p, be as before. Then if we 
apply F to the relations (1) and use the multiplicative and additive properties of F, we obtain 


F(p)F(i;) = lru, Fp Fi) = F(0)=0,  j#k, 
DF (i)F(D)) = lew 


Hence {FM, F(i;)} = UFM,, and {FM, F(p,)} = TLFM,. 

A functor F from mod-R to mod-S is called exact if it maps short exact sequences into short 
exact sequences; that is, if 0 — M' — 4 M 5 M" — Q is exact in mod-R, then 
0+ FM“ FM “S FM”—0 is exact in mod-S. This happens rarely. More common are functors 
that are left exact or right exact, where the former is defined by the condition that if 


rus) 


0+M4M+*M" is exact, then 0—FM'*2 FM*® FM" is exact and the latter means that if 


ru Fia) 


M'ŻMŠM"”=0 is exact, then rw “trm“srm"—0 is exact. Similar definitions apply to 
contravariant functors between categories of modules. These are assumed to be additive in the 
sense that for every (M, N) the map of hom (M, N) into hom (FN, FM) is a group 
homomorphism. Then F is left exact if the exactness of w4M@%M"+0 implies that of 


Ful) 


0+ FM’ FM“ FM’ is right exact if the exactness of 0 > M' —> 4 M & M" implies that of 


FM" &, pm FM'—=0 


We have defined the covariant and contravariant hom functors from an arbitrary category C 
to Set (p. 38). If C = mod-R, it is more natural to regard these as functors to mod-Z. We recall 
that if A € obC, then the (covariant) hom functor hom {4,—) from C to Set is defined as the map 
B ~ hom (A, B) on the objects, and hom (A, —) (f) for f : B — B' is the map of hom (4, B) into 
hom (4, B^, which multiplies the elements of the former on the left by f to obtain the 
corresponding elements in the latter. Now let C = mod-R and let M, N € ob mod-R. Then hom 
(M,N) is an abelian group, hence a 2-module and if f: N — N' and g, h € hom (M, N), then f(g 
+ h) = fg + fh. Thus hom (M, —)(f) is a homomorphism of the 2-module hom (M, N) into the Z- 
module hom(M,N'). Moreover, iff is a second element of hom(N,N’), then (f+ /)g = fg + fg so 
hom (M, —)(f+ f) = hom (M, —)(f) + hom (M, —)(f’). Thus hom (M, —) may be regarded as an 
(additive) functor from mod-R to mod-Z. This will be our point of view from now on. 

A basic property of the hom functors is left exactness: 


THEOREM 3.1. The hom functor hom (M, —) from mod-R to mod-Z is left exact. 


Proof. To prove this we have to show thatif0 — N' 4 N 5 N" is exact, then 


hom(M,-)(f) hom (M, — )(g) 


(44) O-—-hom(M,N’') hom (M,N) hom (M, N”) 


is exact. We are given that f is a monomorphism and im f= ker g and we have to show that the 
same thing holds for hom (M, —)(f) and hom (M, — )(g). Suppose g € hom (M,N') and (hom (M, 
—)(f))(~) = fọ =0. Then ifx € M, fox = 0 and since f is injective, this implies that gx = 0. Thus 
hom (M, —) (f) (g) = 0 implies that g = 0. Since hom (M, —)(f) is a group homomorphism, this 
implies that this is a monomorphism. Next we note that gfọ = 0 since gf = 0, so gfg(x) = 0 for 
all x€ M. Thus hom (M, —)(g) hom (M, —)(f)(g) = 0 for all g : M — N’, so we have hom (M, 
—)(g) hom (M, —)(f) = 0. Then 


imhom(M, —)(f) c kerhom(M, —)(g). 


To prove the reverse inclusion let w € ker hom (M, —) (g), so y € hom (M, N) and gy = 0 or 
gyx for all x e M. Thenifx € M, gyx = 0 shows that yx, which is an element of N, is in ker g. 
Then the hypothesis implies that there is a y e N’ such that fy = yx. Moreover, since f is a 
monomorphism, y is uniquely determined. We now have a map g: x + y of M into N’. It follows 
directly from the definition that g € hom (M, N’) and fo = w. Thus ker hom (M, — )(g) c im 
hom (M, —)(f) and hence we have the equality of these two sets. This completes the proof. o 

In a similar manner, the contravariant hom functor hom (—, M) determined by M € ob mod- 
R can be regarded as a contravariant functor from mod-R to mod-Z. Here hom (—, M)N = hom 
(N, M) and if fN — N’, then hom ( —,M)(f) is the right multiplication of the elements of hom (N 
", M) byf to yield elements of hom (N, M). We have 


THEOREM 3.1’. The contravariant hom functor hom ( —, M) is left exact. 


We leave the proof to the reader. 


EXERCISES 


1. Regard R as right R-module in the usual way (the action of R on R is the right 
multiplication as given in the ring R). Show that the right module M and homp (R, M) are 
isomorphic as abelian groups. More precisely, verify that for any x e M , the map u : a 
~ xa,aeé R, is inhomp (R, M) and x ~ u, is an isomorphism of M as abelian group 
onto homp (R, M). Use this to prove that the hom functor hom(R, —) from mod-R to mod- 


2 and the forgetful functor from mod-R to mod-2, which maps a module into its 
underlying abelian group, etc. are naturally isomorphic. 


2. Let M=2, N=2/(m), m> 1, and let v be the canonical homomorphism of M onto N. 
Show that 1 y cannot be written as vf for any homomorphism f : N — M. Hence show that 


the image of the exact sequence M — N — 0 under hom (N, —) is not exact. 


3. Let M =Z, N= mē, m> 1, and let i be the injection of N into M. Show that the 
homomorphism 1 / mx ~ x of N into M cannot be written as fi for any f : M — M. Hence 
show that the image of the exact sequence 0 —> N — M under hom (-, M) is not exact. 


4. (The “short five lemma.”) Assume that 


M' M g M" 
0 f 0 


0 0 
Nn’ h N k N” 
has exact rows and is commutative. Show that if any two of u, v, and w are 
isomorphisms, then so is the third. 


3.2 ARTINIAN AND NOETHERIAN MODULES 


After the aerial view of module theory that we experienced in the previous section, we shall 
now come down to earth to study modules as individuals. We shall begin with some aspects of 
the structure theory of modules. Categorical ideas will play only a minor role in the next six 
sections; they will come to the fore again in section 3.9 and thereafter. 

We shall first collect some tool results on modules, which are special cases of results we 
proved for Q-algebras (pp. 63—65), namely, the correspondence between the set of 
subalgebras of a quotient of an Q-algebra with the set of saturated subalgebras of the algebra, 
and the two isomorphism theorems. To obtain the corresponding results for modules one can 
regard these as groups with operators, the set of operators being the given ring. The results 
then become the following results on modules: If M is a module and P is a submodule, we have 
a bijection of the set of submodules of the quotient M = M / P and the set of submodules of M 
containing P. If N is a submodule of M containing P, the corresponding submodule of M = M / 


Pis N=N/ P= {y+Plyeé N] andif N is a submodule of M , then we put N= {ye My +P € 
N}. This is a submodule of M containing P and N = N / P. This result can also be verified 
directly as in the case of groups (BAI, p. 64). 

We have the following special cases of the isomorphism theorems for Q-algebras. 


FIRST ISOMORPHISM THEOREM FOR MODULES. /f N, and N, are submodules of 
M, then 


(5) (Vitya+N. yy +N oN), yiEN; 


is an isomorphism of (N; + N3)/N, onto N,/(N; 0 N3). 


SECOND ISOMORPHISM THEOREM FOR MODULES. IfM is a module, N and P 
submodules such that N D P, then 


(6) (x+P)+N/P x+N, xeM 
is an isomorphism of (M / P)/(N/ P) onto M/N. 


These two results can also be established directly as with groups (BAI, pp. 64—65). In fact, 
the group results extend immediately to groups with operators and the case of modules is a 
special case of these. The fact that the results are valid for groups with operators will be 
needed in the next section. 

A module M is called noetherian (artinian) if it satisfies the 

Ascending (descending) chain condition. There exists no infinite properly ascending 
(descending) sequence of modules Mi Mz: Ms =) inM. 

Another way of putting this is that if we have an ascending sequence of submodules M, c 
M> <M; c..., then there exists an n such that M. = s+: =*, One has a similar formulation for 
the descending chain condition. It is easily seen that the foregoing condition is equivalent to the 


Maximum (minimum) condition. Every non-vacuous set of S of submodules of M contains 
a maximal (minimal) submodule in the set, that is, a module P € S such that if N € S and N > P 
(Nc P), then N= P. 


EXAMPLES 


1. The ring Z regarded as Z-module satisfies the ascending but not the descending chain 
condition. Recall that every ideal in Z is a principal ideal (BAI, p. 143) and the submodules of 
Z are ideals. If S/25* is an ascending sequence of ideals in Z, then J = UI, is an ideal. This 
is principal: J = (m) where m € I, so m e I, for some n. Then ^=.: =". (The same argument 
shows that the ascending chain condition holds for every commutative principal ideal domain 
(p.i.d.), D regarded as a D-module. This is proved in BAI, p. 147.) On the other hand, ifm € Z 
and m # 0, + 1, then (™ ? m°)? (m°) 2° so the descending chain condition fails in Z. 


2. Let P be the additive group of rationals whose denominators are powers of a fixed prime 


p:m/pX,me @,k=0, 1,2,.... We regard P as 2-module. We have the ascending chain of 
submodules 


Z Ç Z(1/p) F Z(1/p’) F ZUP) | °°. 


It is easily seen that the submodules of this chain are the only submodules of P containing Z. It 
follows that M = P / Z is artinian but not noetherian. On the other hand, since Z is a submodule 
of P, itis clear that P is neither artinian nor noetherian. 

3. Any finite abelian group is both artinian and noetherian as 2-module. 


It is clear that if M is artinian or noetherian, then so is every submodule of M. The same is 
true of every quotient M / N, hence of every homomorphic image, since the submodule 


correspondence P ~ P = P / N has the property that P; > P, if and only if P = P 2 


We shall now show that conversely if M contains a submodule N such that N and M / N are 
artinian (noetherian), then M is artinian (noetherian). For this we need the 


LEMMA. Let N, P,, P, be submodules of M such that 
(7) P,>P,, N+P,=N+P, NaP;=NoaP, 
Then P} = P}. 


Proof. Letz; € P} Then z; € N+P,=N+ Po, soz 5y tz,yE€ N, z, € P>. Then y =z, — z3 
e Pi, soye Pi N N=P, AN. Thus y € P, and so Z =y +z, € P). Hence P; c P, and P; = 
P, O 


THEOREM 3.2. Jf M contains a submodule N such that N and M / N are artinian 
(noetherian), then M is artinian (noetherian). 


Proof. Let Pi > P) > P} D ... be a descending chain of submodules of M. Then 


(NOP) >(NAP:)>(NAP3)>- is a descending chain of submodules of M. Hence there exists a k 
such that N N Pk =N N Py, ,=.... Also (N+ PIN D (N+ P )/N D ... is a descending chain 


of submodules of M / N. Hence there exists an / such that (N+ P) /N=(N+P)41)/N=.... 

Then N+ P;=N + P),,=.... Taking n = max (k, /) and applying the lemma, we conclude that 

P,,=P,,+1=.... The proof in the noetherian case is obtained by replacing > by c everywhere. 
E 


THEOREM 3.3. Let M =N + P where N and P are artinian (noetherian) submodules of M. 
Then M is artinian (noetherian). 


Proof. We have M/ N=(N+ P)/ N= P /(N N P), which is a homomorphic image of the 


artinian (noetherian) module P. Hence M / N and N are artinian (noetherian). Then M is 
artinian (noetherian) by Theorem 3.2. O 


A ring R is called right (left) noetherian or artinian if R as right (left) R-module is 
noetherian or artinian respectively. A module M is called finitely generated if it is generated 
by a finite number of elements. In the case of right modules this means that M contains elements 
Xp X25- --, Xp Such that M=x,R+x.R+...+x,R. 


THEOREM 3.4. IfR is right noetherian (artinian), then every finitely generated right R- 
module M is noetherian (artinian). 


Proof. We have M = xR + ... + X„R and the epimorphism a ~ x,a of R onto x,R, 1 <i < m. 
Hence x,R is noetherian (artinian). Hence M is noetherian (artinian) by Theorem 3.3 and 
indication. O 


If R is a division ring, the only left or right ideals of R are 0 and R. Hence R is both left and 
right artinian and noetherian. Hence we have the 


COROLLARY. Let M be a vector space over a division ring. Assume M is finitely 
generated. Then M is artinian and noetherian. 


EXERCISES 


1. Prove that M is noetherian if and only if every submodule of M is finitely generated. 


2. Show that if an endomorphism of a noetherian (artinian) module is an epimorphism 
(monomorphism), then it is an isomorphism. 


3. Let V be a vector space over a field F, T a linear transformation in V over F, and let 
F[A] be the polynomial ring over F in an indeterminate 2 Then V becomes an F[A]- 
module if we define 


(dg + ayA + a24? ++ yy A™)X = aox + 4;(Tx)+ a(T?x)+ +a (T"x) 


(BAI, p. 165). Let V have the countable base (x), x2, ...) and let T be the linear 
transformation such that 7x, =0, Tx; 1 =x; i= 1, 2, 3,.... Show that V as F[A]-module 
defined by T is artinian. Let T' be the linear transformation in V such that T' x; =x; 4 1} i = 
1, 2,.... Show that V as F[A]-module defined by 7’ is noetherian. 


3.3 SCHREIER REFINEMENT THEOREM. JORDAN-HOLDER 
THEOREM 


The results that we will derive next are important also for non-commutative groups. In order to 
encompass both the cases of groups and modules we consider groups with operators. We 
remark that the results we shall obtain can also be generalized to Q-algebras—we refer the 
reader to Cohn’s Universal Algebra, pp. 91—93, for these (see References at the end of 
Chapter 2). 

We now consider the class of groups having a fixed operator set A; that is, A is a fixed set of 
unary operator symbols: for each A € A and each G in our class, x + Ax is a map of G into 
itself. Moreover, we require that 


(8) A(xy) = (ax) (Ay). 


Evidently left modules are a special case of this in which G = M, a commutative group written 
additively, and A = R, a given ring. We observe that even in this case the shift from the module 
to the group with operator point of view amounts to a generalization, since it means that we 
drop the module axioms involving the ring structure. We shall refer to groups with the operator 
set A as A-groups. We also speak of A-subgroups and normal A-subgroups meaning subgroups 
and normal subgroups closed under the action of A, that is, if A is in the subgroup and À € A, 
then Ah is in the subgroup. We write GH or H=G to indicate that H is a normal A-subgroup of 
G. A homomorphism f of a A-group G into a A-group H is a group homomorphism of G into H 
such that 


(9) f(Ax) = Af(x) 


for every x € G,A € A. The image AG) is a A-subgroup of H and the kernel f (D) = {ke GRK) 
= 1} is a normal A-subgroup. If H «= G we define the A-factor group G/ H whose elements are 
the cosets xH = Hx with the usual group structure and with the action of A defined by A(xH) = 
(Ax)H, 2 € A. The fact that H is a A-subgroup assures that A(x) is well defined. Moreover, the 
defining property (8) for G/ H is an immediate consequence of this property for G. 

We recall also that if A and B are subsets of a group G, then AB = {abla € A, b e B). If K = 
G and H is a subgroup, then HK = KH since HK = U, e yhK =U, e „Kh = KH. This is a 


subgroup of G and is a A-subgroup if G is a A-group and H and K are A-subgroups. In this case 
(HK) = Hand we have the isomorphism of A-groups of H/ HM K onto HK / K given by 


(10) h(HAK)~+hK, heH. 


The proof for groups without operators carries over without change (BAI, p. 65). In a similar 
manner the other basic results such as those given in BAI, pp. 64-65, carry over. For 
convenience we collect here the results we shall need. 

We have the bijective map H ~ H/ K of the set of A-subgroups of G containing K « G with 
the set of A-subgroups of G / K. Moreover, H «= Gif and only if H/ K «= G/ K. In this case we 
have the isomorphism of (G / K) / (H/ K) with G/ H given by 


(11) (gK)H/K~>gH, geG. 


More generally, suppose we have an epimorphism f of the A-group G onto the A-group G’ and 
H is anormal A-subgroup of G containing the kernel K of f. Then we have the isomorphism of A 


-groups of G/ H onto G'/ f(H) = f(G) / f(A) given by 
(12) gH > f(g)f(H). 


The concept of A-group achieves more than just an amalgamation of the group and the 
module cases. It also provides a method for dealing simultaneously with several interesting 
classes of subgroups of a group. Given a group G we let A be the set of inner automorphisms of 
G. Then the A-subgroups of G are the normal subgroups and the study of G as A-group focuses 
on these, discarding the subgroups that are not normal. Similarly, if we take A to be all 
automorphisms of G, then the A-subgroups are the subgroups that are mapped into themselves 
by all automorphisms. These are called characteristic subgroups. If we take A to be all 
endomorphisms of G, we obtain the fully invariant subgroups, that is, the subgroups mapped 
into themselves by every endomorphism of G. 

We define a normal series for the A-group G to be a descending chain of A-subgroups 


(13) G = G >G, >G; > >G, = l, 


that is, every G; , ; is a A-subgroup normal in G; (although not necessarily normal in G). The 
corresponding sequence of A-factor groups 


(14) G,/G,, GG iss Gigi = G, 


is called the sequence of factors of the normal series (13). The integer s is called the length of 
(13). The normal series 


(15) G=H,>H,cH;>CH,,, =1 


is equivalent to (13) if t = s and there exists a permutation i ~7' of {1, 2,..., s} such that G; / G; 
+1 = H; / Hy , , (isomorphism as A-groups). The normal series (15) is a refinement of (13) if 
the sequence of G;, is a subsequence of the H;. We wish to prove the 


SCHREIER REFINEMENT THEOREM. Any two normal series for a A-group have 
equivalent refinements. 


The proof of the theorem will be based on 


ZASSENHAUS’ LEMMA. Let G, G'i = 1, 2, be A-subgroups of a group G such that G'; = 
G; Then 


(16) (6, AGG, < (Gp AGG; 
(17) (G, NG,)G, 2 (G, NG,)G; 


and 
(18) (Gi AG,)G; n (Gi ^a G,)G; 
(G,AG)G, (G1^G,)G; 


Proof. We shall prove (16), 


(19) (GA GaG a G3) = (G, a G5)(G, A G) < (G, AG) 
and 
(20) (G,;AG,)G, | GAG, G, AG, 


(Gi nG), (Gi 0G,(G,0G,) (6, AGG, AG) 
A diagram that will help visualize this is the following: 


(Gi N G2)G; 


G, N Gz (Gi N GG; 


(G, N GiGi N Gy 


where the factor groups represented by the heavy lines are the ones whose isomorphism we 
wish to establish. Once we have proved the indicated results, then, by symmetry, we shall have 
(17) and 


(Gi A G2)G3/(G1 A G2)G3 = (Gi A G2)/(G; A G3) (G1 A Ga), 


which together with (20) will give the desired isomorphism (18). 

Now consider the A-subgroups G) N G, and G’ of G,. Since G’ «= G, we have 
GG, = GG, ^G) G: AGa (G,G)G, ig a <A-subgroup, and %=(6,%6,)6;. Similarly, 
G, 0G, = G, ^G, Then we have (19). Now form the A-factor group (G: ^ G2)G1/G;3 and consider the 


homomorphism 
f:x~xG, 
of Gi N G, into (6: 2161/6; . This is an epimorphism and 
FG a G3) (G1 9 G,)) = (Gi A G4) (Gi A G,)G)/G; 
= (Gi ^ G4)G;/G}. 
Since (G1 ^ G(G1 ^G) is normal in G, N G, and f is an epimorphism, 
(Gi A G2)G1/G1 < (G1 A G2)G4/G4, 
Then (Gi  G2)G; < (G1 ^G2)G1, which is (16). Moreover, 
(Gi A G)/(G 0 G3) (Gi AG) = f (GA GV (Gi a G4) (G10 G,)) 
(G1 nGa)G, (Gi aG,)G, _ (Gin G)G 
G; Gi 7 (Gin G)G, 


which gives (20). O 
We can now give the 


Proof of the Schreier refinement theorem. Suppose the given normal series are (13) and (15). 
Put 


(21) Ci = (CiO Ay)Gj 4.1, Hi = (H; Gi) Ay +1. 

Then Ga = Gis Gissı = Gis Hn = Ay. Heyes = Hess, By Zassenhaus’ lemma, Gix+1 9 Giz Ayj+1 © Hu and 
ny ~ 

(22) Gin/ Gigs a Ay i/y, i+ a? 


We have the normal series 
(G=) G,> G3 ara Gi, 
> Gz > Gy, 3° Gy 


(23) 


>G,> Gy 2G, 5 Gera) (=1) 


and 


(G=) Hy > Hy, >"> Hy, 
> H, > Hy, 3° > Hy 
(24) 
3H; Sigs © DH, 5H, iq (=). 


The corresponding factors are 


Gi1/Gi2 612/6135- GiGi ss, 
Gy1/G 2, Gy2/Gy3,...5 Ga,/Ga44 E 


(25) 
CAN R 1G OOR r As 
Hol Head of PEOR; PN! ger 
Hy) Na P) RE; O! Ma 55, 
(26) 


Alyy /Ay2, Ayo/Ay3,.« +5 Ats/ As 4+ 1- 


The factor in the ith row and kth column of (25) is isomorphic as A-group to the one in the kth 
row and ith column of (26). Hence (23) and (24) are equivalent. Clearly (23) is a refinement of 
(13) and (24) is a refinement of (15), so the theorem is proved. oO 


A normal series (13) will be called proper if every inclusion is proper: &% # G+:. A normal 
series is a composition series if it is proper and it has no proper refinement. This means that 
we have & #1 and there is no G' such that % PF PCi where G' = G; and G,,, = C. 


Equivalently, G; / G;, 1s simple as A-group in the sense that it is # 1 and it contains no proper 


normal A-subgroup 1. A A-group need not have a composition series (for example, an infinite 
cyclic group has none). However, if it does then we have the 


JORDAN-HOLDER THEOREM. Any two composition series for a A-group are 
equivalent. 


Proof. The factors of a composition series are groups + 1 and the factors of a refinement of a 
composition series are the factors of the series augmented by factors = 1. Now if we have two 
composition series, they have equivalent refinements. The equivalence matches the factors = 1, 
so it matches those # 1, that is, those which are factors of the given composition series. Hence 
these are equivalent. O 


If we take A to be the inner automorphisms, then the terms of a composition series are 
normal subgroups. Such a series is called a chief series. The Jordan-Hélder theorem shows 
that any two of these are equivalent. The result is applicable also to characteristic series 
defined to be composition series obtained by taking A to be all automorphisms and to fully 
invariant series obtained by taking A to be all endomorphisms. 


All of this applies also to modules. In this case we have an important criterion for the 
existence of composition series. 


THEOREM 3.5. A module M #0 has a composition series if and only if it is both artinian 
and noetherian. 


Proof. Assume M has a composition series M = M, > M>% >M,+,=0 (necessarily M; = M; +1 


since the groups are abelian). Let 4:1 P? N2?° ZN, be submodules. Then the given composition 
series and the normal series M >N: >N:>%>N,>N,1=0 have equivalent refinements. It 
follows that t < s + 1. This implies that M is artinian and noetherian. Now assume M +0 has 
these properties. Consider the set of proper submodules of Mı = M. This has a maximal 


element M, since M is noetherian. If M, # 0, we can repeat the process and obtain a maximal 
proper submodule M}, of M,. Continuing this we obtain the descending sequence 
Mı #M,#%M,#°", which breaks off by the descending chain condition with M, , ; = 0. Then 
M=M,>M,>-~>M...=° is a composition series. O 


EXERCISES 


1. Let G be a group with operator set A. Show that G has a composition series if and only if 
G satisfies the following two conditions: (1) If G = G, > G, > ... then there exists ann 


such that G, = G, +1 =.... (11) If His a term of a normal series of G and H, CH, c... 
is an ascending sequence of normal subgroups of H, then there exists an such that H, = 
H, +1: 

2. Let G be a A-group having a composition series and let H « G. Show that there exists a 


compositin series in which one term is H. Hence show that H and G/ H have 
composition series. 


3. Let Gand H be as in exercise 2. Call the number of composition factors the length 1(G). 
Show that /(G) = (H) + 1(G/ H). Show also that if H; « G, i= 1, 2, then 


3.4 THE KRULL-SCHMIDT THEOREM 


The results we shall give in this section are valid for groups with operators and are of interest 
for these also. In fact, these results were obtained first for groups (without operators). 
However, the proofs are simpler in the module case. For this reason we shall confine our 
attention to modules in the text and indicate the extension of the theory to groups with operators 
in the exercises. 

We recall that if “= ©iM:, then we have the homomorphisms i;: M; —> M, P;: M — M; such 
that the relations (1) hold. Now put 


p) 2, = |D; 
(27) e;= iP; 
Then e, e End M (= homg(M, M)) and (1) gives the relations 


e =e, ee=0 if j#k 
(28) 
e te + +e, = I. 


We recall that an element e of a ring is called idempotent if e? = e. Two idempotents e and f are 
said to be orthogonal if ef = 0 = fe. Thus the e, are pair-wise orthogonal idempotents in End M 
and Łie;=1, Put Mj= i(M) = ipM) = eM), Thx € M, x= lx = Vex and ex € M'. Thus M = M', 
+ M'a +... + M'a. Moreover, if x, e M then x, = e;x for some x € M. Then ex; = efx; =e =a 
and e,x; = €,ex = 0 ifj #,. This implies that Mjo (Mi te +Mj-1+Mjs1+ +M) for every j. 


Conversely, suppose a module M contains submodules M,, M),..., M, satisfying the two 
conditions 


(29) M = M,+M,+°':+M, 


(30) M; (My+:+Mj_,4+Mjyyte+M,)=0,  1<j<n. 


Then we have the injection homomorphism i; of M; into M and if x € M , we have a unique 
way of writing x =x, +... +x,, x; € Mj, so we obtain a map P;: M — M; defined by p;x = x;. It 
is clear that the conditions (1) for the i; and P; are satisfied. Then, as before, we have an 
isomorphism of M onto ® M;. In view of this result, we shall say that a module M is an 
(internal) direct sum of the submodules M4, M,,..., M, if these satisfy (29) and (30). We shall 


write 


(31) M=M,®M26°''@M, 


to indicate this. The endomorphisms e, = iP; are called the projections determined by the 
direct decomposition (31). 

It is useful to express the conditions (29) and (30) in element form: Any element of M can 
be written in one and only one way in the form x; + x) +... + x,, x; E Mj. Using this, one sees 
easily that if M is a direct sum of submodules M; and each M, is a direct sum of submodules 
M; then M is a direct sum of the submodules M;;. There are other simple results of this sort 
that we shall use when needed (cf. BAI, pp. 176—177). 

A module M is decomposable if M = M, ® M, where M; # 0. Otherwise, M is called 


indecomposable. 


PROPOSITION 3.1. A module M + 0 is indecomposable if and only if End M contains no 
idempotent + 0, 1. 


Proof. IfM =M; ® ... ® M, where the M, are # 0 and n > 1, then the projections e, are 


idempotents and e; + 0. Ife, = 1, then! = Xie: gives e; = e;l = ee, = 0 for every l #7. Hence e; 
mp j j l l Ij j 


+ 1. Conversely, suppose End M contains an idempotent e + 0, 1. Put e; = e, e, = 1 — e. Then 


e? =(1—e)?=1-2e +e? =1-2e + e= 1 -e= e, and eje, = e(1 — e) = 0 = ene). Hence the e; 
are non-zero orthogonal idempotents. Then we have M = M, ® M, where M; =e;(M) +0. o 


Can every module be written as a direct sum of a finite number of indecomposable 
submodules? It is easy to see that the answer to this is no (for example, take M to be an infinite 
dimensional vector space). Suppose M is a direct sum of a finite number of indecomposable 
submodules. Are the components unique? Again, it is easy to see that the answer is no (take a 
vector space of finite dimensionality > 1). Are the indecomposable components determined up 
to isomorphism? It can be shown that this need not be the case either. However, there is a 
simple condition to assure this. We shall first give the condition and later show that it holds for 
modules that are both artinian and noetherian, or equivalently, have composition series. The 
condition is based on a ring concept that is of considerable importance in the theory of rings. 


DEFINITION 3.1. A ring R is called local if the set I of non-units of R is an ideal. 


If R is local, it contains no idempotents except 0 and 1. For, if e is an idempotent + 0, 1, 
then e is not a unit since e(1 — e) = 0. Also 1 — e is not a unit. Hence if R is local, then e and 1 
—e are contained in the ideal Z of nonunits. Then 1 e Z, which is absurd. It follows that if End 
M for a module M +£ 0 is local, then M is indecomposable. We shall now call a module M 
strongly indecomposable if M + 0 and End M is local. The main uniqueness theorem for direct 
decompositions into indecomposable modules, which we shall prove, is 


THEOREM 3.6. Let 
(32) M = M,@M,@:-OM, 


(33) N =N,@N,@-"-ON, 


where the M, are strongly indecomposable and the N; are indecomposable and suppose M = 
N. Then m =n and there is a permutation j ~ j' such that M; = N;, 1 <j <n. 


We shall first separate off a 


LEMMA. Let M and N be modules such that N is indecomposable and M + 0. Let f and g 
be homomorphisms f. M — N, g: N — M such that gf is an automorphism of M. Then f and g 
are isomorphisms. 

Proof. Let k be the inverse of gf so kgf = 1. Put l = kg: N — M. Then Jf = 1), and if we put e 
= fl: N — N, then e? = flfl = fl yl = fl = e. Since N is indecomposable, either e = 1 or e = 0 and 
the latter is ruled out since it would imply that 1), = 1,7 = IfIf = lef = 0, contrary to M +£ 0. 


Hence fl = e = 1y. Then fis an isomorphism and g =k !f-! is an isomorphism. o 


Proof of Theorem 3.6. We shall prove the theorem by induction on n. The case n = 1 is clear, 
so we may assume n > 1. Let e,,..., e,, be the projections determined by the decomposition (32) 


of M, and let f,..., f,, be those determined by the decomposition (33) of N. Let g be an 
isomorphism of M into N and put 


(34) h=fge, kj=eg 'f, 1<j<m. 


Then =tkjhy= Eeg “fae: = eg Ege, = eg 'lvger =e, Now the restrictions of e; and k;h; to M, 
map M, into M, so these may be regarded as endomorphisms e',, (4;h;)’ respectively of M}. 
Since M, = e,(M) and eł = e, e' = ly, Hence we have ly, = ).(4;h;)’. Since End M; is local, 
this implies that one of the (4;h,)' is a unit, therefore an automorphism of Mj. By reordering the 
N, we may assume j = 1, so (k;h;)' is an automorphism of M}. The restriction of h4 to M, may 
be regarded as a homomorphism h,’ of M1 into N}. Similarly, we obtain the homomorphism k’ 
of N; into M} by restricting kı to N, and we have k,'h,' = (k,/,)’ is an automorphism. Then, by 
the lemma, /,' = (f,ge))' : M, —> N,and k,' = (e£ tfi)! : N} —> M, are isomorphisms. 
Next we prove 


(35) M = g"1N,@(M,+""+M,). 
Let *€9 Ni O (M+ +M). Then ejx= 0 since x € M, +... + M, and x = g` ly, y © Nj. Then 
O=ex=eg y= eg ‘hy=ky=ky, Then y = 0 and x = 0; hence, 9 'NO(M2 + +M,)=0, Next let 


iar oe ee OA cee 
seg NıcM'=g NitMate"+Mu Then x, e€5x,..., €,x E€ M' and hence ex € M'. Thus 


M'> eg 'N, =e1g™'fiNy = kiN, = kiN; =M, Then M'=>M+1<j<n and so M' = M. We therefore have 
(35). 


We now observe that the isomorphism g of M onto N maps g` !N, onto N;. Hence this 
induces an isomorphism of M / g~ N; onto N / N}. By (35), M/g `N, = Ma+-+M,=M:9=®M, , 


~ 


Since N/Ni = N29 Na, we have an isomorphism of M28: M, onto N28 Ne, The theorem 
now follows by induction. O 


We shall now show that these results apply to modules that have composition series. 

If f is an endomorphism of a module M, we define M =N=) ang 705 Uno kerf”, 
Thus x € f” M if and only if for every n = 1, 2,... there exists a y, such that f"y„ =x and x € f 
© 0 if and only if f’x = 0 for some n = 1, 2, .... The P(M) form a descending chain 


(36) M>f(M)>f2(M)>°°: 
and since f”x = 0 implies that f” * !x = 0, we have the ascending chain 


(37) Ockerfckerf? cs. 


Since the terms of both chains are submodules stabilized by f, f° M and f “0 are submodules 
stabilized by f. We have the following important result. 


FITTING’S LEMMA. Let f be an endomorphism of a module M that is both artinian and 
noetherian (equivalently, M has a composition series). Then we have the Fitting 
decomposition 


(38) M=f"M@f *0. 


Moreover, the restriction off to f° M is an automorphism and the restriction of f to f- ° M 
is nilpotent. 


Proof. Since M is artinian, there is an integer s such that f(M) = f (M) = ... = f°M. Since 
M is noetherian, we have a ¢ such that ker f = ker ff *! =... = f °0. Let r = max(s, t), so f°M = 
f (M) and f” 0 = ker f. Let z € fM N f 0, so z = f'y for y e M. Then0=/'z=/"y andy € 
ker f” = ker f”. Hence z = fy = 0. Thus f°M N f °0 = 0. Now let x € M. Then fx € f(M) = 
F (M) so fx = fy, y e M. Then f(x -f (y)) = 0 and so z =x- f(y) € f” 0. Then x= fy +z 
and fy e f° M. Thus M = f°M + f “0 and we have the decomposition (38). Since f ”0 = 
kerf”, the restriction of f to f ”0 is nilpotent. The restriction of f to fM =f'(M) =f'*'(M) is 
surjective. Also it is injective, since /"M naf“ =0=0 implies that °M N ker f=0. O 


An immediate consequence of the lemma is 


THEOREM 3.7. Let M be an indecomposable module satisfying both chain conditions. 
Then any endomorphism f of M is either nilpotent or an automorphism. Moreover, M is 
strongly indecomposable. 


Proof. By Fitting’s lemma we have either M = f°M or M =f “0. In the first case f is an 
automorphism and in the second, f is nilpotent. To prove M strongly indecomposable, that is, 
End M is local, we have to show that the set J of endomorphisms of M that are not 
automorphisms is an ideal. It suffices to show that if fe I and g is arbitrary, then fg and gfe l 
for any g € End M and if fi, fa € J, then fi + fa € I. The first one of these follows from the 
result that if fe l, then f is nilpotent. Hence fis neither surjective nor injective. Then fg and gf 
are not automorphisms for any endomorphism g. Now assume fi +/2¢!, so fi +f; is a unit in End 


M. Multiplying by (f, +4) | we obtain h, + h, = 1 where h;=f(/'+ PY ! € I. Then h; = 1 — h, 
is invertible since h,” = 0 for some n, and so (I~ hg) (I+ hgte thy") = b= (1 +h, + +h"™')(1— ha), 
This contradicts h € l. o 


We prove next the existence of direct decompositions into indecomposable modules for 
modules satisfying both chain conditions. 


THEOREM 3.8. Jf M # 0 is a module that is both artinian and noetherian, then M 
contains indecomposable submodules M,, 1 < i <n, such that M = M,®M:9 @M,, 


Proof. We define the length of M to be the number of composition factors in a composition 
series for M (exercise 3, p. 110). If Af is a proper submodule, then the normal series M ? N > 
0 has a refinement that is a composition series. This follows from the Schreier refinement 
theorem. It follows that the length of N < length of M. If M is indecomposable, the result holds. 
Otherwise, M = M, ® M, where the M, # 0. Then the length (M) < (M), so applying 
induction on length we may assume that Mı = Mi1.@Mj2@°"@M,,,, M2 = Mai OM22® Mm, where the 
M; are indecomposable. The element criterion for direct decomposition then implies that 
M= My, OMO OMn DMa O OM, 


The uniqueness up to isomorphism of the indecomposable components of M is an 
immediate consequence of Theorems 3.6 and 3.7. More precisely, we have the following 
theorem, which is generally called the 


KRULL-SCHMIDT THEOREM. Let M be a module that is both artinian and noetherian 
and let M = M,®M:® ƏM, = NıƏN2® Nn where the M; and N, are indecomposable. Then m = 
n and there is a permutation i ~ i' such that M;= N;, 1 <i<n. 


This theorem was first formulated for finite groups by J. H. M. Wedderburn in 1909. His 
proof contained a gap that was filled by R. Remak in 1911. The extension to abelian groups 
with operators, hence to modules, was given by W. Krull in 1925 and to arbitrary groups with 
operators by O. Schmidt in 1928. For a while the theorem was called the “Wedderburn- 
Remak-Krull-Schmidt theorem.” This was shortened to the “Krull-Schmidt theorem.” 


EXERCISES 


1. Let p € Z be a prime and let R be the subset of @ of rational numbers a / b with (p, b) = 
1. Show that this is a subring of @, which is local. 


2. Show that the ring Z, of p-adic numbers (p. 74) is local. 


3. Let A be an algebra over a field that is generated by a single nilpotent element (z” = 0). 
Then A is local. 


4. Let F be a field and R the set of triangular matrices of the form 


in M,(F). Show that R is a local ring. 


5. Let M and f be as in Fitting’s lemma and assume M = Mọ ® M} where the M; are 
submodules stabilized by f such that f | Mọ is nilpotent and f | M} is an automorphism. 
Show that Mọ =f “0 and M, =/"M. 


6. Use Theorem 3.6 to prove that if R® and R© are local rings and n, and n, are positive 


integers such that M.,(R") = M,,(R"), then n; = ny and RY = RO, 


7. Let M be a finite dimensional vector space over a field F, 7 a linear transformation in 
M, and introduce an FTA]-module structure of M via T, as in BAI, p. 165. Here å is an 
indeterminate. Use the Krull-Schmidt theorem to prove the uniqueness of the elementary 
divisors of T (see BAI, p. 193). 


8. Let the hypotheses be as in Theorem 3.6. Prove the following exchange property: For a 
suitable ordering of the N; we have N; = M; and 


M= g (N ,)® Dg (NJOM, +18 DM n 


l<k<n. 
The remainder of the exercises deal with the extension of the results of this section to A- 
groups (groups with an operator set A) and further results for these groups. 


9. Let Gand H be A-groups and let hom (G, H) denote the set of homomorphisms of G into 
H (as A-groups). Iff, g e hom (G, H), define f+ g by (f+ g)(x) =f(x)g(x). Give an 
example to show that this need not be a homomorphism. Show that if K is a third A-group 
and h,k e hom (H, K) then hfe hom(G, K) and h(f+ g) =hf+ hg, (h+k)f=hf + kf. 
Define the 0 map from G to H by x ~ 1. Note that this is a homomorphism. 


10. Let G be a A-group, G4, G,..., G„ normal A-subgroups such that 


(39) G=6,6,""G, 
(40) Gin Gi Gi-Gi G= l 


n 


for 1 <i <n. Show that if x; € G; x; € G; for i 4j, then xx; = x;x;, and every element of G 
can be written in one and only way in the form x; x7 ... X„, x; E G;. We say that G is an 
(internal) direct product of the G; if conditions (39) and (40) hold and indicate this by G 
= Gi x Gz X ... x G, Let e; denote the map x,x,...x, x; Verify that e; e€ End G = hom 


(G, G) and e; is normal in the sense that it commutes with every inner automorphism of G. 


n 


Verify that e? = e, ee, = 0 if i j, e; + e; = e; + e; for any i and j, and e; + e3 +... +e, = 
1. (Note that we have associativity of + so no parentheses are necessary.) Show that for 
distinct i»----% im {1,2,....mJ.e, +e, +7" +e END G. 


11. Call G indecomposable if G + G, x G, for any two normal A-subgroups # G, 1. Call G 
strongly indecomposable if G £ 1 and the following condition on End G holds: Iff and g 
are normal endomorphisms of G such that f+ g is an automorphism, then either for g is 
an automorphism. Show that this implies indecomposability. Use this concept to extend 
Theorem 3.6 to groups with operators. 


12. Extend Fitting’s lemma to normal endomorphisms of a A-group satisfying both chain 
conditions on normal A-subgroups. 


13. Extend the Krull-Schmidt theorem to A-groups satisfying the condition given in exercise 
12. 


14. Show that iff is a surjective normal endomorphism of G, then f= 1¢ + g where g(G) c 
C, the center of G. 


15. Prove that a finite group G has only one decomposition as a direct product of 
indecomposable normal subgroups if either one of the following conditions holds: (1) C 
= 1, (ii) @' = G for C the derived group (BAI, p. 245). 


3.5 COMPLETELY REDUCIBLE MODULES 


From the structural point of view the simplest type of module is an irreducible one. A module 
M is called irreducible if M + 0 and M contains no submodule N such that M ? N 29. This is a 
special case of the concept of a simple A-group that we encountered in discussing the Jordan- 
Holder theorem (p. 108). We have the following characterizations, assuming R 0. 


THEOREM 3.9. The following conditions on a module are equivalent: (1) M is 
irreducible, (2) M + 0 and M is generated by any x + 0 in M, (3) M = R / I where I is a 
maximal right ideal in R. 


Proof. (1) = (2). Let M be irreducible. Then M + 0 and if x # 0 in M, then the cyclic 
submodule xR + 0. Hence xR = M. Conversely, suppose M # 0 and M = xR for any x #0 in M. 
Let N be a submodule + 0 in M and let x e N, x 4 0. Then N > xR = M. Hence M is 
irreducible. 


(1) = (3). Observe first that M is cyclic (M = xR) if and only if M = R / I where / is a right 
ideal in R. If M = xR, then we have the surjective module homomorphism a ~ xa of R into M. 
The kernel is the annihilator / = ann x of x in R (BAL p. 163). This is a right ideal and M = R / 
I. Conversely, R / I is cyclic with generator 1 + J and if M = R / I, then M is cyclic. We 
observe next that the submodules of R / I have the form /' / J where I’ is a right ideal of R 
containing J. Hence R / I is irreducible if and only if / is a maximal right ideal in R. Now if M 
is irreducible then M is cyclic, so M = R / I and / is a maximal right ideal. The converse is 
clear. Hence (1) (3). O 


Perhaps the most important fact about irreducible modules is the following basic result. 


SCHUR’S LEMMA. Let M and N be irreducible modules. Then any homomorphism of M 
into N is either 0 or an isomorphism. Moreover, the ring of endomorphisms End M is a 
division ring. 


Proof. Let fbe a homomorphism of M into N. Then ker fis a submodule of M and im f is a 
submodule of N, so the irreducibility of M and N imply that ker f= M or 0 and imf = N or O, If 
f #0, ker f # M and im f # 0. Then ker f = 0 and im f = N, which means that f is an 
isomorphism. In the case N = M the result is that any endomorphism f + 0 is an automorphism. 
Then f ! € End M and this ring is a division ring. O 


The modules that we shall now consider constitute a direct generalization of irreducible 
modules and of vector spaces over a division ring. As we shall see, there are several 
equivalent ways of defining the more general class. Perhaps the most natural one is in terms of 
direct sums of arbitrary (not necessarily finite) sets of submodules. We proceed to define this 
concept. 


Let S = {M,} be a set of submodules of M. Then the submodule X} M, generated by the Ma 
is the set of sums 


Xa, HX H Xa Xa EL Mg. 

M:OEu, am, Ms =0_ Otherwise, S is 
dependent. If N is a submodule, then {N} is independent. If we look at the meaning of the 
definition of dependence in terms of elements, we see that a non-vacuous dependent set of 
modules contains finite dependent subsets. This property implies, via Zorn’s lemma, that if S is 
a non-vacuous set of submodules and T is an independent subset of S, then T can be imbedded 
in a maximal independent subset of S. If M = )M, and S = {M,} is independent, then we say 
that M is a (internal) direct sum of the submodules M, and we write M = ® M, This is a 


direct generalization of the concept for finite sets of submodules that we considered in section 


The set S is called independent if for every M, € S we have 


3.4. 
We shall require the following 


LEMMA 1. Let S= {Ma} be an independent set of submodules of M, N a submodule such 
that NN X Ma =Q. Then S U {N} is independent. 


Proof. If not, we have an x € Ma (€S) such that %57 +a + +a #0 where y € N, Xa, € 

M, E Sand Ma $ Map 1 sik. Ify = 0 we have x, =x, +... + Xa, contrary to the 

independence of the set S. Hence y + 0. Then = +> Xa =" = Xa EXM. contrary to N N X M, £ 0. 
O 


We can now give the following 


DEFINITION 3.2. A module M is completely reducible if M is a direct sum of irreducible 
submodules. 


Evidently any irreducible module is completely reducible. Now let M be a right vector 
space over a division ring A. It is clear that if x # 0 is in M, then xA is an irreducible 
submodule of M and M = >), + oxA so M is a sum of irreducible submodules. Now M is 


completely reducible. This will follow from the following 


LEMMA 2. Let M=} M, M, irreducible, and let N be a submodule of M. Then there exists 
a subset {Mz} of {M,} such that {N} U {Mp} is independent and M =N +N +}, Mg. 


Proof. Consider the set of subsets of the set of submodules of {M,} U {N} containing N and 
let {Ms} U {N} be a maximal independent subset among these. Put “= V*+2Me MSM, then 
there exists an M, such that M, # M’. Since M, is irreducible and M, N M' is a submodule of 
Ma, we must have Ma N M' = 0. Then, by Lemma 1, {Mz} U {N} U {M,} is independent 
contrary to the maximality of {M,} U {N}. Hence M = N + X Mg. O 


This lemma has two important consequences. The first is 


COROLLARY 1. Jf M is a sum of irreducible modules, say, M = YM, M, irreducible, 
then M = ® M; for a subset {Mz} of {M,}. 


This is obtained by taking N = 0 in Lemma 2. 

IfM is a right vector space over A, then M = } y + vA and every xA is irreducible. Hence M 
is completely reducible by Corollary 1. 

We have also 


COROLLARY 2. Jf M is completely reducible and N is a submodule, then there exists a 
submodule N' of M such thatM=N@N’. 


Proof. Let M = $} M, where the M are irreducible. Then, by Lemma 2, there exists a subset 
{Mp} of {M} such that M = > Mp + N and {Mz} U {N} is independent. If we put N’ = X} M5 
we have M=N@ N. DO 


The property stated in Corollary 2 is that the lattice L(M) of submodules of a completely 
reducible module M is complemented (BAI, p. 469). We shall show that this property 
characterizes completely reducible modules. We prove first 


LEMMA 3. Let M be a module such that the lattice L(M) of submodules of M is 
complemented. Then L(N) and L(M ) are complemented for any submodule N of M and any 
homomorphic image M of M. 


Proof. Let P be a submodule of N. Then M = P @ P' where P' is a submodule of M. Put P" = 
P' A N. Then N=NAM =NA(P+P)= P+P, by modularity of L(M) (BAI, p. 463). Hence N= P @ 
P". This proves the first statement. To prove the second we may assume M =M / P where P is 
a submodule of M. Then M = P @ P' where P' is a submodule and M = P'. Since L(P’) is 
complemented by the first result, L(M ) is complemented. O 


The key result for proving that L(M) complemented implies complete reducibility is 


LEMMA 4. Let M be a non-zero module such that L(M) is complemented. Then M contains 
irreducible submodules. 


Proof. Let x + 0 be in M and let {N} be the set of submodules of M such that x É N. This 
contains 0 and so it is not vacuous. Hence by Zorn’s lemma there exists a maximal element P in 
{N}. Evidently P # M, but every submodule P, ? P contains x. Hence if P, and P, are 


submodules such that P} ? P and P, ? P, then P; N P, ? P. It follows that the intersection of 


any two non-zero submodules of M / P is non-zero. Hence if P’ is a submodule of M such that 
M = P @ P' then P” = M / P has the property that the intersection of any two non-zero 
submodules of P’ is non-zero. Then P' is irreducible. For, P’ # 0 since M ? P and if P’; is a 


submodule of P’ such that P’; # P’, 0, then by Lemma 3, we have a submodule P’, of P’ such 
that P' = P’ ® P,'. Then P’ N P,' = 0, but P? 4 0, Py # 0 contrary to the property we 
established before for P'. O 


We are now in a position to establish the following characterizations of completely 
reducible modules. 


THEOREM 3.10. The following conditions in a module are equivalent: (1) M = YM, 
where the M, are irreducible, (2) M is completely reducible, (3) M + 0 and the lattice L(M) 
of submodules of M is complemented. 


Proof. The implication (1) = (2) has been proved in Corollary 1, and (2) = (3) has been 
proved in Corollary 2. Now assume condition (3). By Lemma 4, M contains irreducible 


submodules. Let {M,} be the set of these and put M' = $} M,. Then M = M' ® M" where M" is 


a submodule. By Lemma 3, L(M") is complemented. Hence if M" + 0, then M" contains one of 
the M, This contradicts M"^ M'= M"o2M,=0, Thus M" = 0 and M = M' = } M, Hence (3) = 


(1). o 


Let N be an irreducible submodule of the completely reducible module M. We define the 
homogeneous component Hy of M determined by N to be X N' where the sum is taken over all 


of the submodules MW’ = N. We shall show that M is a direct sum of the homogeneous 
components. In fact, we have the following stronger result. 


THEOREM 3.11. Let M= ® M,, where the M,, are irreducible and the indices are chosen 
so that Map M ag if and only if a = a'. Then H, = dig Map is a homogeneous component, M 
= @ H, and every homogeneous component coincides with one of the H,. 


Proof. Let N be an irreducible submodule of M. Then N is cyclic and hence 
Ne M'=M,@M.@~@M, where M, e {M,,}. If we apply to N the projections on the M; 
determined by the decomposition of M', we obtain homomorphisms of N into the M, By 


Schur’s lemma, these are either 0 or isomorphisms. It follows that the non-zero ones are 
isomorphisms to M; contained in a subset of {M,,}, all having the same first index. Then N c 


H, for some a. If N' is any submodule isomorphic to N, we must also have N’ c H,. Hence the 
homogeneous component Hy c H,. Since H, c Hy is clear, we have Hy = H,. Since N was 


arbitrary irreducible, we account for every homogeneous component in this way. Also the fact 
that M = ® H, is clear, since M = ® M,p and H, =} Mp O 


EXERCISES 


1. Let M be a finite dimensional (left) vector space over a field F. Show that M regarded 
as a right module for R = EndpM in the natural way is irreducible. Does this hold if F is 


replaced by a division ring? Does it hold if the condition of finiteness of dimensionality 
is dropped ? 

2. Let M be the vector space with base (e4, e2) over a field F and let R’ be the set of linear 
transformations a of M / F such that ae, = ae), ae, = fe, + ye, where a, p, y E F. Show 


that R’ is a ring of endomorphisms and that if M is regarded in the natural way as right R 
"module, then M is not irreducible. Show that Endp, M is the field consisting of the 


scalar multiplications x ~ ax. (This example shows that the converse of Schur’s lemma 
does not hold.) 


3. Exercise 2, p. 193 of BAI. 
4. Let V be the F[A]-module defined by a linear transformation T as in exercise 3, p. 103. 


Show that V is completely reducible if and only if the minimal polynomial m(A) of T is a 
product of distinct irreducible factors in FA]. 


5. Show that ® M, together with the injections W:M» ®™; constitute a coproduct of the Mg. 
6. Suppose {M} is a set of irreducible modules. Is MM, completely reducible? 


7. Show that a completely reducible module is artinian if and only if it is noetherian. 


8. Let M be completely reducible. Show that if His any homogeneous component of M, 
then H is an Endp M submodule under the natural action. 


3.6 ABSTRACT DEPENDENCE RELATIONS. 
INVARIANCE OF DIMENSIONALITY 


A well-known result for finite dimensional vector spaces is that any two bases have the same 
number of elements. We shall now prove an extensive generalization of this result, namely if M 
is a completely reducible module and = ©M,= Mi where the M, and M's are irreducible, 
then the cardinality KM3! = Mi), This will be proved by developing some general results on 
abstract dependence relations. The advantage of this approach is that the results we shall 
obtain will be applicable in a number of other situations of interest (e.g., algebraic dependence 
in fields). 

We consider a non-vacuous set X and a correspondence A from X to the power set # (X). We 
write x < S if (x, S) €e A. We shall call A a dependence relation in X if the following 
conditions are satisfied: 

1. Ifx e S, thenx < S. 

2. Ifx < S, then x < F for some finite subset F of S. 

3. Ifx < Sand every y € S satisfies y < T, then x < T. 

4. Ifx < Sbutx x S— {y} (complementary set of {y} in $), then y < (S— {y}) U {x}. 
This is called the Steinitz exchange axiom. 

The case of immediate concern is that in which X is the set of irreducible submodules of a 
completely reducible module M and we define M, < S for M, € X and S c X to mean that 
M.<YusesMs Property 1 is clear. Now suppose “+S EwsessMs and let x, #0 be in M,. Then 
%.€2a,<rMy for some finite subset F of S. Then M== XR = XM, , Hence 2 holds. Property 3 is clear. 
Now let "=S EwsesMs and Ma% Eu,eueMy, As before, let x, #0 in M, and write 


Xa = Xp HX to +X, 
where xg € Mg, Then M. © My, +My, ++ Min. We may assume that the xg, #0 and My # My ifi + 


j. Moreover, the condition M= * Em, + uM» implies that one of the M g, = Mp. We may assume i = 1. 
Then 0#% 5 X=3am Xm, which implies that Ms=%eR°Ms+My,+°"+My.. = Then 
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Ms < (M. M;|M, # My} This proves the exchange axiom. 

We now consider an arbitrary dependence relation on a set X. We call a subset S of X 
independent (relative to A) if no x € S is dependent on S — {x}. Otherwise, S is called a 
dependent set. We now prove the analogue of Lemma | of section 3.5. 


LEMMA 1. Jf Sis independent and x < S, then S U {x} is independent. 


Proof. We have x < S so x É S. Then if S U {x} is not independent, we have a y € S such that 
y< (S— {y}) U {x}. Since Sis independent, y < S— {y}. Hence by the exchange axiom, x < (S 
— {y}) U {y} = S contrary to hypothesis. Hence S U {x} is independent. O 


DEFINITION 3.3. A subset B of X is called a base for X relative to the dependence 
relation A if (i) B is independent, (ii) every x € X is dependent on B. (Note B = Ø is 
allowed.) 


THEOREM 3.12. There exists a base for X and any two bases have the same cardinal 
number. 


Proof. The “finiteness” property 2 of a dependence relation permits us to apply Zorn’s lemma 
to prove that X contains a maximal independent subset, for it implies that the union of any 
totally ordered set of independent subsets of X is independent. Then Zorn’s lemma is 
applicable to give the desired conclusion. Now let B be a maximal independent subset of X 
and let x e X. Then x < B; otherwise, Lemma | shows that B U {x} is independent contrary to 
the maximality of B. Hence every x € X is dependent on B, so B is a base. 


Now let B and C be bases. If B = Ø, it is clear that C= Ø so we assume B £ Ø, C £ Ø. 
Suppose first that B is finite, say, B = {x}, X>,..., X,}. We prove by induction on k that for 1 < k 
<n + 1 there exist y; € C such that (1----Yk-1%---»%d is a base. This holds for k = 1 since B is a 
base. Now assume it for some k. We claim that there is a y in C such that Y * Waves ¥e-a9 Xe eters Xn) 
. Otherwise, every y € C is dependent on P = Wie %e-1Xerr-e%e | Then x < D for every x € B 
contrary to the independence of \W--+++¥e-1»%---»*). Now choose y = y, so that y, * D. Then 
Wires ig independent, by Lemma 1. Moreover, since Yi < (ar-+sde-1%e----%a}, it follows 
from the exchange axiom that “*~ Wi Yr-1YrXr+1»---X}, Then all of the elements of the base 
[Pien are dependent on Vee Yws} and hence every x € X is dependent on 
Wire Xen Gd Then WievrtiXeene%d base. We have therefore proved that for every k we 
have a base of the form Vee YeX+sı=X} with the y’s in C. Taking k = n we obtain a base Wr 
-> Ynzo Yi E C. It follows that C= {y),..., Yny. Thus | C| =n =|B |. The same argument applies 
if | C | is finite, so it remains to consider the case in which both | B | and | C | are infinite. To 
prove the result in this case we shall use a counting argument that is due to H. Lowig. For each 
y € C we choose a finite subset F, of B such that y < F,. Then we have a map y ~ F,, which 
implies that | C | > | {F,} |. Since every F, is finite, this implies that SolC > IUF,I. Since | C | is 
infinite, we have SelCl= Ci, Thus | C | > | U F, |. Now UF, = B. Otherwise, every y € C is 


dependent on a proper subset B’ of B and since C is a base, every x € B is dependent on B’. 
This contradicts the independence of the set B. Hence | C | = | UF, | =| B |. By symmetry | B | > | 


C|. Hence |B|=|C|. O 


We remark that the first part of the preceding proof shows that any maximal independent 
subset of X is a base. The converse is evident. We mention also two other useful supplements 
to the result on existence of a base: 


(i) If S is a subset such that every element is dependent on S then S contains a base. 

(ii) If S is an independent subset then there exists a base containing S. 
We now apply Theorem 3.12 to completely reducible modules by taking X to be the set of 
irreducible submodules of a completely reducible module M and defining Me~< {Mg} if M, = =M,_ 
We have seen that this is a dependence relation. Moreover, independence of a set of 


irreducible modules in the sense of < is the independence we defined before for submodules of 
a module. A set B = {Mz} of irreducible modules is a base for X if it is independent and every 


irreducible module M, c }'Mz. Since M is a sum of irreducible submodules, this condition is 
equivalent to M = )'Mz. Hence B = {Mz} is a base if and only if M = ® )'M;,. The theorem on 
bases for a set X relative to a dependence relation now gives the following 


THEOREM 3.13. Let M = ® M= ® N, where the Mg and N, are irreducible. Then | {Mg} | 
=| {N,} |: 


We shall call | {Mp} | the dimensionality of the completely reducible module M. If M is a 
right vector space over A with base (x,), then M = @ x,A and every x,A is irreducible. Hence 
the dimensionality of M is the cardinality of (x$). Thus any two bases have the same 
cardinality. This is the usual “invariance of dimensionality theorem.” 

We can also apply Theorem 3.13 and the decomposition of a completely reducible module 
into homogeneous components to obtain a Krull-Schmidt theorem for completely reducible 
modules. 


THEOREM 3.14. Let M = ® M= ® N, where the Mg and N, are irreducible. Then we have 
a bijection between the sets Ma) {*) such that the corresponding modules are isomorphic. 


This is clear, since for each homogeneous component the cardinality of the set of Mz in this 
component is the same as that of the set of N, in this component. 


EXERCISE 


1. Show that if M is a completely reducible module that has a composition series, then the 
length of M as defined in exercise 3 (p. 110) coincides with the dimensionality. 


3.7 TENSOR PRODUCTS OF MODULES 


The concept of tensor product of modules, which we shall now introduce, is a generalization of 
an old concept for vector spaces that has played an important role for quite a long time in 
several branches of mathematics, notably, differential geometry and representation theory. In 
the case of vector spaces, this was originally defined in terms of bases. The first base-free 
definition seems to have been given by H. Whitney. This was subsequently improved to take 
the standard form that we shall present. Before giving this, we look at an example that is 
somewhat in the spirit of the older point of view. 


EXAMPLE 


Let R be a ring. We wish to consider simultaneously free left and free right modules for R. To 
distinguish these we use the notation R0 for the free right module with base of n elements and 
(R for the free left module having a base of n elements. For our purpose, it is convenient to 
write the elements of “R as rows: (Xi -> Xn), X; E R, and those of R®™ as columns: 


x, 


(41) x 


X 


n 


In the first case the module action is by left multiplication and in the second by right 
multiplication by elements of R. We now consider R™, ™R and Mm, »(R) the additive group of 
m x n matrices with entries in R. Ifx e R™ as in (41) and y = (y,..., y,) € “YR, then we can 
form the matrix product 


X1 XiVy XyVo Xi Vn 
XoVy X2V2 XVn 

(42) xy= ETR, B = 
Xm X mY 1 X my2 aa Xm n 


This product is distributive on both sides and satisfies an associativity condition (xa)y = x(ay), 
a €e R. Hence it is an instance of a balanced product in the sense that we shall now define for 
an arbitrary pair consisting of a right module and a left module. 


Let M be a right module, N a left module for the ring R. We indicate this situation by writing 


Mp and PN in the respective cases. We define a balanced product of M and N to be an abelian 


group P (written additively) together with a map f of the product set M x N into P satisfying the 
following conditions: 


Bl. f(xtxyy) =f (x,y) +f(xy). 
B2. f(x,yty) = f(x, y)+f (x, y’). 
B3. f(xa,y)= f(x,ay). 


Here x,x' € Mp, y, y' E€ pN, a € R. Note that these conditions imply as in the case of addition 
and multiplication in a ring that AO, y) = 0 = f(x, 0) and f- x, y)= — fx, y) = f(x, — y). We 
denote the balanced product by (P, f). If (Q, g) is a second one, then we define a morphism 
from (P, f) to (Q, g) to be a homomorphism y of the additive group P into the additive group Q 
such that for any x y e M x N we have 


(43) g(x,y) = nf (x,y). 
We can now introduce the following 


DEFINITION 3.4. A tensor product of Mp and pN is a balanced product (M®«N.®) such that 
if (P, f) is any balanced product of Mp and pN, then there exists a unique morphism of 


(M@eN.®) to (p, f). In other words, there is a unique homomorphism of the abelian group M @ 
RN into the abelian group P sending every x ®@y,x € M, y € N into f (x,y) € P. 


It is clear from the definition that if (M:N): ®9ı) and (M®kN)82) are tensor products, then 
there exists a unique isomorphism of (M ® RN); onto (M ® g N)>. such that *®@iv>*@.y. It is 
also clear that if (M ® RN, ®) is a tensor product, then the group M ® pN is generated by the 
products x ® y. In fact, since — (x ® y) = (-x)® y, it is clear that every element of M ® pN has 
the form Lx:@yi. eM, WEN, 

We proceed to construct a tensor product for a given pair (Mp, pN). For this purpose we 


begin with the free abelian group F having the product set M x N as base. The elements of F 
have the form 


(44) ni(Xi y1) + N9(X>, V2) y aaa. y nX, Vp) 


where the n; € Z, x; € M, y;E N. Addition is the obvious one, and if (x; y;) # @;, y;) for i +j, 
then (44) is 0 if and only if every n; = 0. Let G be the subgroup of F generated by all of the 
elements 


(x + x,y) — (x,y) — (x,y) 
(45) (x,y + y) — (x,y) — (x, y’) 


(xa, y) — (x, ay) 
where x, x'e M, y,y'€ N,a e R. Now define 
(46) MON = F/G, x@y=(x,y)+ GEeM@,N. 


We claim that (M ® pN, ®) is a tensor product of M and N. First, we have 
(x+x)Ay-xQy -x @y=((x+x,y) + G)—((x,y) + G) -—((x,y) + @ 
= ((x + x,y) — (x,y) — (x,y) + G 


=G6=0. 


Similarly, x @ (y +y) =x @y+x@y’' and xa @y =x @ ay. Hence (M ® pN, ®) is a balanced 
product. Now let (P, f) be any balanced product. Since F is the free abelian group with base M 
x N, we have a (unique) homomorphism of F into P sending (x, y) ~ f(x, y). Let K denote the 
kernel of this homomorphism. The conditions B1, B2, B3 imply that 


(x + x’, y) — (x,y) — (x,y) 
(x, y T y) a (x, y) a (x, y) 


(xa, y) — (x,ay) 


x,x' E M,y,y'€ N,aeé R, are in K. Then G c K and so we have a homomorphism of M @pN = 
F / G into P sending x ® y = (x, y) + G ~ f(x, y). This is unique, since the cosets x ® y generate 
M @pN. Hence we have verified that (M @pN, ®) is a tensor product of Mp and pN. 

We shall now assume that somehow we have made a determination of (M @pN, ®). We 
simplify the notation to M @ N or M @pN if we need to specify the ring R, and we speak of the 


tensor product of M and N. Now suppose we have module homomorphisms f. M — M' and g: 
N— N. Then we have the map 


(x,y) > XOY 


of M x N into M' ® N’, which satisfies the conditions for a balanced product of M and N since 


L(x+x)@gy = fx@gy +X @gy 
ARJ ty’) =fx@gy +/x@gy’ 
f (xa)®@gy = fx@glay). 


Hence we have a unique homomorphism of M ® N into M’ @ N' such that 


X@y > fx@gy. 
We denote this as f®@ g, so, by definition, 
(47) (89) (x®y) = fx@gy 


for all xe M, y € N. Suppose next that we have homomorphisms f : M' > M", g': NM > N". 
Then ff: M — M", g' g:N — N"' and 


(f'(g'9) (x@y) =f fx@g'gy = (f'@g') Agy) 
= (f'@g') (({@g) (x@y)) 
= ((f'@g') ({@g)) (x@y). 


Since the elements .x ® y generate M @ N, this implies that 


(48) (9) g) =f fg. 
Since (lu 81x8») = x87, we have 
(49) lu®ly = IMON: 


These results amount to saying that the maps 
(M,N)>M@N, (f9) > fg 
define a functor from the category mod-R x R-mod to the category Ab (or 2-mod). We shall 


denote this functor as ® p. 


Again let f: M — M' in mod-R and g : N > N' in R-mod. Then we have the commutativity 
of the diagram 


MON lyg 


SO ly p SO ly 


M'ON Og MON 


that is, we have 


(50) (f@1y)(1y®g) = fBg = (ly EISE 1x). 


This is an immediate consequence of (48). We also have that the following distributive laws 
(fi +/2)@g =f @gth@g 
fI U +92) = SOJ: +/@92 
for AM >M’, ¢:N—N' will follow directly by applying the two sides to x8 y,xe M,yeN. 
We shall now fix one of the arguments in the functor ® p and study the resulting functors of 


one variable. Let M be a fixed right R-module. Then we define the functor M ® p (or M ®) by 


specifying that for any left R-module N and any homomorphism g: N — N' of left R-modules, 
we have 


(51) 


(MORN = MQN, (MOr (g) = ly @g. 


We have | @¢'¢ = (1ng) (lu ®g) for g':N'=N" and Iy@ly=Iwev, Moreover, we have the additivity 
property since !w®(i +92) = lw@gi +1u®gz by (51), Hence M ® p is indeed a functor from R-mod to 


Z-mod in the sense that we have adopted in this chapter. In a similar manner, any left module 
defines a functor ® pN (or ® N) from mod-R to mod-Z by 


(@gN)M=M@gN, (@gN)(f) =f@ly 


for f. M— M' in mod-R. 
We now specialize M = R = Rp, R regarded as right R-module. Then we have 


PROPOSITION 3.2. N be a left R-module. Then the map 
(52) Ny: y> 1®y 


is an isomorphism of N regarded as a £-module with R ® pN. Moreover, N ~ ny is a natural 
isomorphism of the forgetful functor from R-mod to Z-mod (or Ab) with the functor R ® p. 


Proof. Evidently (52) is a Z-homomorphism. On the other hand, for r € R, y € N, f(r, y) = ry 
is a balanced product of R and N. Hence we have a Z -homomorphism Cy of R ® pN into N such 


that G(r ® y) = ry. It is clear that §*"* = 'y ad nxx = lren, Hence the first assertion is valid. The 
second assertion is an immediate consequence of the definitions. O 


In a similar manner, if we define "u:M = M@«R by x +x@!, we see that this is an isomorphism 
and M ~ 7’, is a natural isomorphism of the forgetful functor from mod-R to mod-Z with the 


functor ® pR. The reader should note that a similar result for the hom functor was given in 
exercise 1, p. 99. 


The general result we noted in section 3.1 (p. 98) that functors on categories of modules 
respect finite coproducts can be applied to the functors M ® p and ® pN. In this way we obtain 


canonical isomorphisms of M ® (® N,) with @(M ® N;) and (®@ M)® N with @(M,;, ® N). 
Combining these two we obtain an isomorphism of (®7MI@(@iN) with OP) Mi@N; , 

Proposition 3.2 shows that we have an isomorphism of R ® RR onto the additive group of R 
such that r ® s ~ rs. With this result and the isomorphism of (®MJ®(ƏN,) onto @(M\@N)) it is easy 
to see that the example given at the beginning of this section of the balanced product of R@” and 
"R into Mp, ,(R) is the tensor product. 

The result on the isomorphism of a finite coproduct @(M; ®© N) with (® M;)® N can be 
extended to arbitrary coproducts. Let {M, | a € J} be an indexed set of right R-modules. Then 
® M, is the set of functions (x,) : ax, E€ M, such that xg = 0 for all but a finite number of £, 
and if N is a left R-module, (® M, ® N is the set of sums of elements (x,)® y, y € N. Similarly, 
@(M, ® N) is the set of functions (z,), a z, E€ Ma ® N such that Zg = 0 for all but a finite 
number of p. Since every element of M, ® N is a sum of elements *.®» %.©M. YEN, it follows 
that every element of @(M, ® N) is a sum of elements (x, ® y) where this denotes the function 
a Xg y. We shall now prove 


PROPOSITION 3.3 Ifthe M, a € I, are right R-modules and N is a left R- module, then 
we have an isomorphism 


Nv:(OM,)@N > O(M,@N) 


such that 
(53) (x,)®@y (x,y). 
Proof. Let (x,) € My y € N. Then *:®))¢ O(M.@N) and ((x,)y) (9) defines a balanced product 


of ® M, and N into @(M, ® N). Hence we have a homomorphism of (®@ M,) @ N into @(M, ® 
N) such that (x,)® y (x, ® y). On the other hand, if i, is the canonical homomorphism of M 
into ® M, (sending xa into the function whose value at a is x, and whose value at every p # a 


is 0), we have the homomorphism i® tx of M,@N into (MJN . By the defining coproduct property 
of @(M, ® N) we obtain a hoomomorphism ‘w:®(4,@N) +(®M,)@N such that 


(x,®y) (x,)®y. 


Checking on generators, we see that yy and ¢y are inverses. Hence (53) is an isomorphism. O 


Now let N’ be a second left R-module and let f : N’ — N. We have the homomorphism 
lu.®f of M,@N'~M,@N which defines the homomorphism f* of ®(M@.@N)~ @(M,®N) such that 
(x.@y) y) for x, E Ma y' E N. We also have the homomorphism 18/ of (®M)®N' into (@ 


M,)2 N where 1 = 1@M,. Now we have the naturality of the isomorphism yy given in 


Proposition 3.3, that is, 


(@ M,) ® N’ Tv" ®@(M,® N’) 
(54) 197 p 
(OM,)®N Ny © (M, ® N) 


is commutative. This follows directly by checking on the generators (x,)®@ y’. 
We shall derive next the fundamental exactness property of the tensor functors. 


THEOREM 3.15. The functors M ® pand ® g N are right exact. 


Proof. Suppose N'N N". Since g is surjective, every element of M ® p N” has the form 
Lx, @gy, EM, WEN, Thus 1 ® g is anamorphism of M ® N onto M ® N". Hence to prove that 
MON ŽL MoN = MON", it remains to show that ker (1 ® g) = im (1 ® f). Since gf=0, we have 
(1 82) (1 8f) = 0 and im (18) < ker(18g). Hence we have a group homomorphism © of (M ® 
N)/ im (1 @f) into M ® N" such that *8y +im(18f)~x8gy. This is an isomorphism if and only if 
im (1 ®f)= ker (1 ® g). Thus it suffices to show that © is an isomorphism. Let x e M, y" € N" 
and choose a y € N such that gy = y". We claim that the coset *®@y+im(I@/) in (M@NYim(1@f) is 
independent of the choice of y. To see this, let y4, y> € N satisfy gy; = y" = gy2. Then g(y, — y2) 
=0 y -y =fy' for y' e N'. Then *@y: +im(1@f) = x@y2+im(1@/), We now have a map of M x N" 
into (M ® N) / im (1 ® f) such that %y") ~> x@y+im(1@/) where gy = y". It is clear that this is a 
balanced product. Hence we have a homomorphism 6’ of M @ N" into (M 8 N)/im(1 @ f) 
such that 0'(x ® y") =x @y + im(1 ® f) where gy = y". Checking on generators we see that 
OF = luov and 78 = lwonimion, Thus 9 is an isomorphism. This proves the first assertion. The 
second is obtained ina similar fashion. o 


EXERCISES 


1. Let R” be the opposite ring of R and regard a right (left) R-module as a left (right) R°- 
module in the usual way. Show that there is a group isomorphism of M ® pN onto N@eeM 


mapping x ® y into y @x,x E€ M, ye N. Note that if R is commutative, this applies to M 
® pNand N® pM. 


2. Show that if mis a positive integer, then Zem = z/(m (® = @.). Hence show that 
0+ mZ@(2/(m)) + Z@(Z/m) is not exact for i the injection of mZ in Z. Note that this shows that 


M ® and ® M need not be exact functors. 


3. Use Proposition 3.2 and Theorem 3.15 to show that if m and n are positive integers, then 
(2/(m)@(Z/(n)) = Zd) where d = (m, n) the g.c.d. of M and n. 


4. Generalize exercise 3 to R a p.i.d. Use this and the structure theorem for finitely 
generated abelian groups (BAI, p. 195) to determine the structure of the tensor product 
of two finitely generated modules over a p.1.d. 


5. Let ig, Jp, Ka, p be the canonical monomorphisms of M, into ® M,, of Ng into ® Nz and of 
M, ® NP into ® M, ® Nz. Show that there is an isomorphism of (©™,)@(@N») onto @M.@N, 
sending {Bip into kx, By, 


3.8 BIMODULES 


Let M be a right R-module and let R' = End M. Then M can be regarded as a left R'-module if 
we define fx for f e R', x € M, to be the image of x under the map f. Since f is a 
homomorphism of the additive group of M, we have f(x + y) = fx + fy, and by definition of the 
sum and product of homomorphisms, we have (f+ g)x = fx + gx and (fg)x = fex) iff g e R’. 
Clearly also 1x = x so M is a left R'-module. Also the definition of module homomorphism 
gives the relation 


(55) f (xa) = (fx)a 


forxe M,ae R, fe R'. This associativity connects the given right R-module structure with 
the left R’-module structure of M. More generally we now introduce 


DEFINITION 3.5. Jf R and S are rings, an S-R-bimodule is a commutative group M 
(written additively) together with actions of S and R on M such that M with the action of S 
(that is, the product sx, s € S,x € M) is a left S-module, M with the action of R is a right R- 
module, and we have the associativity condition 


(56) s(xr) = (sx)r 
forallse Sxe Myre R. 


The foregoing considerations show that if M is a right R-module, then M can be regarded in 
a natural way as an R'-R-bimodule for R' = End M. 


EXAMPLES 


1. L Let R be commutative. Then any right R-module M can be regarded also as a left R- 
module by putting ax = xa, a e€ R, x € M. It is often advantageous to regard M as an R-R- 


bimodule with ax = xa. The associativity condition a(xb) = (ax)b is an immediate consequence 
of the commutativity of R. 


2. Again let R be commutative and let y be an automorphism of R. If M is a right R-module, 
M becomes a let R-module by defining ax x(a). This action together with the given right 
action makes M an R-R-bimodule. 


3. Any right R-module can be regarded as a £-R-bimodule by defining mx for m e Z in the 
usual way. Similarly, any left R-module becomes an R-2-bimodule. In this way the theory of 
one-sided modules can be subsumed in that of bimodules. 


4. If R is a ring, we have the left module pR and the right module Rp. Since we have the 


associative law (ax)b = a(xb), we can put the two module structures together to obtain a 
bimodule. 


We write M = Mp to indicate that M is an S-R-bimodule. We define a homomorphism of 
sMp into Np to be a map of M into N that is simultaneously an S-homomorphism and an R- 


homomorphism of M into N. It is clear from the associativity condition (54) that for any s, x sx 
is an endomorphism of Mp (M as right .R-module) and x xr is an endomorphism of sM ifr € R. 


Given ;Mp and 7Np one is often interested in the homomorphisms of M into N as right R- 
modules. The set of these is denoted as homp(;Mp, 7Np), which is the same thing as homg(M, 
N). This is an abelian group. Since for s € S, x sx is an endomorphism of Mp, the composite 


map fy: 
(57) (fs)x = f (sx) 


is a homomorphism of Mp into Np if fe homg(M, N). Similarly, ift € T, then if we follow f by 
the map y ty, y € N, we obtain a homomorphism of Mp into Np. Thus tf defined by 


(58) (tf x = (fx) 


is in homp(Mp, Np). It is clear from the associative and distributive laws for homomorphisms 


of modules that we have Kf +h) = htt, 
(+= tft taf, (t= otf), (tbs = fis +S, f(s) +52) = fs, +f > 


f (S152) = (f5,)82 if fre homg(M g, Ng), s.s:E€S, ttEeT, Also we have 1f =f = fl and (tfs = t(fs). We therefore 
have the following 


PROPOSITION 3.4. The abelian group homp(sMp, Np) becomes a T-S-bimodule if we 
define tf and fs for f € homg( Mpg, tNpg), s € S, t € T, by 


(tf)x = (fx), — (fs)x = f (sx), 


xe M. 


Ina similar manner, in the situation pMs, rN; we obtain the following 


PROPOSITION 3.5. The abelian group homp(pMs, pN7) becomes an S-T-bimodule if we 
define 


(59) (sf)x =f(xs), fA) =f 


We leave the proof to the reader. 
An important special case of Proposition 3.4 is obtained by taking the dual module M*¥ = 
hom (Mp, Rp) of a right R-module M. We showed at the beginning of our discussion that the 


right-module M can be considered in a natural way as an R’-R-bimodule for R’ = End M. Also 
R = pRp. Hence, by Proposition 3.4, M* is an R-R’-bimodule if we define 


(60) (ry*)x =r(y*x),  (y*r)x = y*(r'x) 


forre R,r'e R',y*e M*,xe M. 
We now consider tensor products for bimodules. Given Mpg and pN; we can form the 
tensor product with respect to R of the right module M and the left module N. Then we have 


PROPOSITION 3.6. The tensor product sMg ® pNris an S-T-bimodule if we define sz = (s 
®© 1)z and #=418t) for z€sM:®:Nr., Here s is the endomorphism x ~ sx of Mp and t is the 
endomorphism y ~ yt of pN. 


The verification is immediate and is left to the reader. 


Propositions 3.4, 3.5, and 3.6 show one advantage in dealing with bimodules rather than 
with one-sided modules: Tensoring or “homing” of bimodules having a common ring in the 
right place yields bimodules. It is interesting to see what happens when one iterates these 
processes. A first result of this sort is an associativity of tensor products, which we give in 


PROPOSITION 3.7. We have an R-U-bimodule isomorphism of 
(pM s@sNy)@rPy onto tMsOGN 1O Po) such that 


(x®@y)@z > x®(y@z) 
forxe M,ye N,ze P. 
Proof. For s € S we have 
XS@(y@z) = x®s(y@z) = x@(s@1) (y@z) = x@(sy@z). 


Hence for fixed z, /(%¥) = *®W®2) is a balanced product of M and N. Hence we have a group 
homomorphism of M ® N into M @(N ® P) sending *® ~*®@U®2), This maps Lx @y,~ Lx, @,@=) . 
Now define /(2*:®».2)= 2, @ ®2), This defines a balanced product of M @ N as right 7-module 
and P as left 7-module. Then we have a group homomorphism of (M @ N)® P into M &(N 8 P) 


such that (x®»)@= ~ x@(v@z), It is clear that this is, in fact, an R-U-bimodule homomorphism. In a 
similar manner, we can show that we have a bimodule homomorphism of M @(N ® P) into (M 
© N)® P such that *®W@=)~(x@y)@:. It is clear that composites in both orders of the two 
homomorphisms we have defined are identity maps. Hence both are isomorphisms. O 


We now consider the bimodules M = «Ms, N =sNr,P=vPr, Then M ® N is an R-7-bimodule 
and since P is a U-T-bimodule, hom (M ® N, P) is a U-R-bimodule by Proposition 3.4. Next 


we note that hom;(N, P) is a U-S-bimodule and hence hom, (M, hom,(N, P)) is a U-R- 
bimodule, again by Proposition 3.4. Now let f e hom{M ® N, P). Then f : y f(x 8 y) € 
hom,(N, P) and x f, e hom, (M, hom,(N, P)). We have 


PROPOSITION 3.8. “= «Ms, N =sNr,P = uPr , then the map 9 :f of) of hom,(M @ N, P) such 
that (f) is xf, x € M, where f, is y f(x y), y € Nis an isomorphism of hom {M ® N, P) 
onto hom,(M, hom,(N, P)) as U-R-bimodules. 


Proof. Direct verification shows that g is a U-R-bimodule homomorphism. In the other 
direction, let g e hom,(M, hom,({N, P)). Then if x € Mandy e M, g(x)(y) € P. Ifs € S, we 


have AxsXy) = (lxsy) = gsy), Since g(x)(y) is additive in x and y, this defines a balanced 
product of M, and sN. Hence we have a group homorphism f of M ® N into P such that f(x ® y) 


= g(x)y. Replacing y by yt and using the uniqueness of the corresponding f we see that f is a T- 
homomorphism. Put y(g) = f Then the definitions show that gy(g) = g and wo(f) = f. Hence ø 
is an isomorphism. O 


EXERCISES 


1. Let M be an R-R-bimodule and let S = R x M endowed with the abelian group structure 
given by these structures in R and M: t.x)+(.x') = +r.x+x), Define a product in S by 


ELN = (rr rx’ FX): 


Verify that this defines a ring with unit 1 = (1, 0) containing the subring of elements of 
the form (7, 0) and the ideal of elements of the form (0, x). These can be identified with 
R and M respectively. Then S = R ® M and MÊ = 0. 


2. Let M be an R-S-bimodule for the rings R and S. Form the ring T= R @ S. Show that M 
becomes a 7-7-bimodule by defining for (r, s)€ R@®Sandxe M 


(r,s)x = rx, XPS) = XS; 


Apply the construction in exercise 1 to define a ring determined by R, S and M as a set 
of triples (r, s, x). 


3. Specialize Proposition 3.8 by taking R = T= U =Z to obtain an isomorphism of homz (M 


® N, P) onto hom (M, homz (N, P)) for “= 2M» N =sNz, P = z Pz. Use this to show that the 
functor M ® p from R-mod to Z-mod (or Ab) is a left adjoint of the functor homz (M, —) 
from Z-mod to R-mod. 


4. Let S be a subring of a ring R. Then any right R-module becomes a right S-module by 
restricting the action to S. Homomorphisms of right R-modules are homomorphisms of 
these modules as right S-modules. In this way one obtains a functor F from mod-R to 
mod-S (“restricting the scalars to S”). Show that F has both a left and a right adjoint. 
(Hint: Given a right S-module M, consider M ®,R where R is regarded as left S-module 


in the obvious way. Consider also hom,(R, M).) 


3.9 ALGEBRAS AND COALGEBRAS 


We shall now specialize the theory of tensor products of modules to the case in which the ring 
is a commutative ring K. Considerably more can be said in this case. In particular, tensor 
products of modules over K provide an alternative definition of an algebra that in many ways 
is preferable to the one we gave originally (p. 44). For one thing, the new definition suggests 
the dual concept of a coalgebra. Moreover, it facilitates the definition of tensor products of 
algebras. Both of these concepts are of considerable importance. We shall give a number of 
important constructions of algebras, notably, tensor algebras, exterior algebras, and symmetric 
algebras defined by K-modules. 

If K is a commutative ring, then K is isomorphic to its opposite ring K°?. Hence any left 
(right) K-module can be regarded as a right (left) K-module by putting kx = xk, k e K. Thus we 
need not distinguish between left and right modules; we shall simply speak of modules over K. 
Another special feature of modules over a commutative ring is that for a fixed / € K, the scalar 
multiplication x ~ lx is an endomorphism. This is an immediate consequence of the 
commutative law. 

It is also sometimes useful to regard a module over K as a K-bimodule in which the two 
actions of any k € K on the module M coincide: kx = xk. Again, the commutativity of K insures 
that the conditions for a bimodule are satisfied. 

If M and N are K-modules, we can regard these as K-K-bimodules and form M ® ,N, which 


is again a K-K-bimodule. Ifk € K,x € M, ye N, then 
k(x®y) = kx@y = x@ky = (x@y)k. 


In view of this relation, we may as well regard M ® gN simply as a K-module (left or right). 
The map (x, y) + x @y ofM x N into M @ Nis K-bilinear in the sense that for fixed x, y ~ x @ 
y is a K-homomorphism and for fixed y, x + x ® y is a K-homomorphism (cf. BAI, p. 344). 

Now let f :(x, y) + f(x, y) be any K-bilinear map of M x N into a K-module P. Then we 
have 


Bl. f(xy + xy) = f(xy) + f(x) 
B2. f(x yi + Yo) = f(x,y) +(x, ya) 
B3. fikx, y) = kf(x, y) = f(x, ky) 


for ke K,xe M ,y e N. Since kx = xk, ke K,x € M, Bl, B2, and B3 imply that (P, f) is a 
balanced product of M and N. Hence we have the additive group homomorphism of M ® gN 
into P such that x ® y ~ f(x, y). Evidently, by B3, this is a K-module homomorphism. 
Conversely, if z is a K-homomorphism of M ® gN into a K-module P, then /(.») = x8») is a K- 
bilinear map of M x N into P. 

In particular, f(x, y) = y ® x defines a K-bilinear map of M x N into N ® M. Hence we have 
a K-homomorphism o of M @ N into N ® M such that x 8 y ~ y ® x. Similarly we have a K- 
homomorphism o’ of N ® M into M ® N such that y ® x ~ x ® y. Then o' =o !, soa is an 
isomorphism. Taking V = M we obtain the automorphism of M ® M such that x 8 y ~ y@x. 

We recall also that we have the associativity of tensor multiplication in the sense that if M, 
N, and P are K-modules, then we have a K-isomorphism of (M @ N)® P into M @ (N ® P) such 
that (*@y)@z ~ x@(y@z) for xeM forxe M ,y € N,z e P (Proposition 3.6, p. 135). 

We can now give the alternative definition of an algebra over K: 


DEFINITION 3.6. Jf K is a commutative ring, an (associative) algebra over K (or a K- 
algebra) is a triple (A, a, €) where A is a K-module, x is a K- homomorphism of A ® A into A, 
and e is a K-homomorphism of K into A such that the following diagrams are commutative 


A@A@A 7@l A@A 
1@a Gi 
A@A = A 


KOA AOK 


Here we have written A ® A @ A for either (4@4)@4 or A@(A@A), which we identify via the 
isomorphism mapping (*®y)@z~ x@(y@z), x,¥,.2€4, and z @ 1 is the K-homomorphism applied to 
(A @ A) @ A while 1 ® zis applied to A ® (A @ A). The unlabelled arrows are the canonical 
isomorphisms sending k ® x, k e K, x € A, into kx and x ® k into xk = kx. 

Suppose we have an algebra according to this definition. We introduce a product 


(61) xy = m(x@y) 


in A and we define 1 in A by 
(62) gl = L 


Let x, y, z € A and apply the commutativity of the first diagram to (*®»)®@z = *®@(v@2) in A @ A @ 
A. This gives the associative law (xy)z = x(yz). If we take the element k ® x in K ® A and apply 
the commutativity of the left-hand triangle in the second diagram, we obtain 
kx = x(e@1)(K@x) = a(kK1@x) = (k1)x, Similarly, the right-hand triangle gives x(K1) = kx. It follows 
that 1x = x = xl and we have the algebra condition key) = (kxly =x(ky) by the associative law 
applied to kl, x, and y. 

Conversely, suppose we have an algebra in the first sense. Then the given product xy is K- 
bilinear, so we have a K-homomorphism z of A ® A into A such that z(x ® y) = xy. We define 
exK—+A by k~kl, Then we have the commutativity of the first diagram by the associative law 
and of the second by kx = (k1)x = x(k1), which follows from k(xy) = (Ax)y = x(ky). 

Thus the two definitions are equivalent. 

We have given a number of examples of algebras over fields in BAI, pp. 406—409 and pp. 
411—414. We shall now give some examples of algebras over an arbitrary commutative ring K. 


EXAMPLES 


1. EndgM. Let M be a K-module, A = End M. We know that this has a ring structure with the 
natural definitions of addition, zero, negatives, product, and 1. Also Æ is a K-module and k(fg) 
= (kf)g = Akg) for k e K, f, g € A, since applying these to any x e M gives MU@=) (Wigs, Ulka), 
All of these are equal by the definition of (k/)x = f(kx) = k(fx). Hence we have an algebra over 
K in the old sense. 


2. Tensor algebras. Before giving our next construction, we need to consider a general 
associativity property of tensor products of K-modules. Let M,, M),..., M, be K-modules. We 


define “:®°'®™. inductively by “:2@°°2™: = (",@--@M,.)@M,. Also if x; e M.. we define %8 8x 
y bY j J’ 


inductively by +87 8x = (@~@x,.)@%, Now we claim that if 1 < m < n, we have a unique 
isomorphism z,,, „ Of (Mi:®@M,)@(M,-:@°@M,) onto Mi®~@M, such that 


(63) Ton nl 1D OX) @(Xm+ 1 OX) = X41 OXm@m+ 1B" OX 


This is immediate by induction on  — m using the isomorphism ¥®%)®?=M@(N@P) given in 
Proposition 3.7. 

Now suppose all of the M, = M, a given K-module, and put M" = M@M times, 7 = 1, 2, 3, 
.... Also put M® = K and define mos KOM" -~ M", n= 0.1... to be the canonical isomorphism 
mapping k ® x into kx. Define z, , as the canonical isomorphism sending x ® k into kx = xk. 
Now form the K-module 


(64) T(M) = @ M® =K@M@M29- 
0 


We can modify the definition of z,, ,, 0 < m < n, n = 0, 1,2,... so as to regard this as a K- 


homomorphism into 7(M). Using the fact that e(“”"@™™”) is the coproduct of the modules MO ® 
M”) we obtain a homomorphism z’ of ® (MY ® MY) into T(M), which coincides with z,, , on 
M“@M°"-" (identified with their images in ew”em™)). We also have an isomorphism of 
T(M)@T(M) = (®M")@(@M") onto @(M"@M") (exercise 5, p. 133). Hence we obtain a homomorphism z 
of (M) ® T(M) into T(M), which is found by following this isomorphism with the 
homomorphism z’. Now let € be the injection of K into NM) = ® M®. We claim that (7(M), z, 
£) is an algebra in the sense of Definition 3.6, or equivalently, if we put xy = z(x ® y), we have 
the associative law and 1x = x = x1. The second of these is clear. To prove the first, we note 
that any element of T(M) is a sum of terms of the form k1 and elements of the form *:®°®*=, so 
it suffices to prove associativity of multiplication of terms of these forms. Then the general 
associativity will follow by the distributive law. Associativity of products, one of which is £1, 
is clear since the definition of the product gives (kl)x = kx = x(k1). Now consider 
X= XO Olm V= NO Om = 2B Sz, mnp>O, The definition of z gives 


(xy)z = ((x,®°: 'OXm (VD OYn) (21 @"**@z,) 
= XO AXDI O OVAA OZ) 
=X ®&: ' ‘@xn®y; ®): ' LOAEC l '®Zp 


In a similar fashion, one obtains the same result for x(yz). Hence our definitions do give a K- 
algebra. This is called the tensor algebra T(M) defined by the given K-module M. 

We now consider the forgetful functor F from the category K-alg to the category K-mod. We 
claim that (T(M), i) where i is the injection of M in 7(M) constitutes a universal from M to F. 
To see this, let f be a K-module homomorphism of M into a K-algebra A. We define a K- 
homomorphism f” of M” into A, n = 1, 2,... by f? = f, and is the homomorphism of MO = 
M(~') ® M obtained by composing f“~ À) ® f with the homomorphism of A ® A into A, sending 
x ® y into xy. Then for x; e M we have 


(65) SOD BX) =F (Xq) S) 


We define f: K — Aas k ~ kl and we let f* be the K-module homomorphism of T(M) into A 
that coincides with /™ on M™, n = 0, 1,2, It is immediate from (65) that f* is a K-algebra 
homomorphism. Now it is clear from the definition of T(M) that (M) is generated by M. 
Hence f* is the only homomorphism of T(M) into A that coincides with f on M. Thus (7(M), i) 
is a universal from M to the forgetful functor from K-alg to K-mod. 

An algebra A is called graded (by the natural numbers 0, 1,...) if 4= @*4 where A; is a 
submodule of A and 4;4; C A; , ;. The submodule 4; is called the homogeneous part of degree i 
of A. An example of this sort that should be familiar to the reader is obtained by taking 4 = 
K[A,,.-., A], the polynomial algebra over K in indeterminates 1,,..., 4, (BAI, pp. 124-126). 


Let A; be the set of homogeneous polynomials of (total) degree 7 (BAI, p. 138). Then these 
provide a grading for A = K[A,,..., 4,]. The tensor algebra T(M) is graded by its submodules 
MÔ since we have T(M) = ® MÖ and MMY = MMY = MY”, 

The tensor algebra is the starting point for defining several other algebras. Perhaps the most 
important of these are exterior algebras and symmetric algebras, which we shall now define. 


3. Exterior algebras. We define the exterior algebra E(M) of the K-module M as 


(66) E(M) = T(M)/B 
where B is the ideal in 7(M) generated by the elements 
(67) X®x, 


x € M. It is clear from the definition that the elements in the ideal B are contained in L»:™". 
Hence B N M = 0 and the restriction to M of the canonical homomorphism of 7(M) onto E(M) 
= T(M) / B is injective. Then we can identify M with its image and so regard M as a subset of 
E(M). It is clear that M generates E(M). It is clear also that since the ideal B is generated by 
the homogeneous elements x ® x, B is homogeneous in the sense that B= ¥ B® where B® = B 
MM, It follows that E(M) is graded by the subsets EY = (M + B) / b, that is, we have E(M) 
= @ EV) and BEM Bt, 

Now suppose 4 is a K-algebra and f is a K-module homomorphism of M into A such that 
f(x)? = 0 in A for every x e M. The universality property of T(M) gives a homomorphism /* of 
T(M) into A, extending the given map f of M. Since f*(x)* = f(x)* = 0, x e M, the kernel of f* 
contains every x ® x, x e M. Hence it contains the ideal B defining E(M). Then we have a 
homomorphism / of E(M) into A sending x (as element of E(M)) into f(x). In other words, any 
K-module homomorphism fof M into an algebra A satisfying fx)? = 0, x € M, can be extended 
to a K-algebra homomorphism of E(M) into A. Since E(M) is generated by M, it is clear that 
this homomorphism is unique. Thus E(/) has the universality property that any K-module 
homomorphism f of M into a K-algebra such thatf(x)? = 0 for all x € M has a unique extension 
to an algebra homomorphism of E(M) into A. This property implies that in the case in which M 
is a finite dimensional vector space over a field, the exterior algebra we have defined here is 
the same as the algebra we constructed in BAI, pp. 411—414, using bases. 


4. Symmetric algebras. We start again with 7(M) and we now factor out the ideal C 
generated by the elements of the form 


(68) x@y — y@x 


x, y E€ M. The resulting algebra S(M) = T(M) / C is called the symmetric algebra of the K- 
module M. The arguments used for the exterior algebra show that we may regard M as a subset 


of S(M) which generates S(M) and S(M) is graded by the submodules SY = (MY + C)[C. We 
have xy = yx in S(M) for the generators x, y e M. This implies that S(M) is a commutative 


algebra. Moreover, it is easily seen that S(M) together with the injection of M into S(M) 
constitutes a universal from M to the forgetful functor from the category Comalg of 
commutative algebras to the category K-mod. 


5. Universal.enveloping algebra of a Lie algebra. We recall that a Lie algebra L over a 
commutative ring K is a K-module equipped with a bilinear composition (x, y)~+ [xy] that 
satisfies the two conditions 


[xx] =0 
[ssl] + (Loeb) + [Lexy] =0 


If A is an associative algebra, A defines a Lie algebra 4” whose K-module structure is the same 
as that of A and whose Lie product is [xy] = xy — yx (BAI, p. 432). If 4; and A, are associative 
algebras, a homomorphism of A, into A, is also a homomorphism of A, into 4, . In this way 


we obtain a functor F from the category K-alg of associative algebras over K to the category 
K-Lie of Lie algebras over K. We proceed to show that given any Lie algebra L there exists a 
universal from L to the functor F. To construct this, we form the tensor algebra 7(L) for the K- 
module 4: Tit) = KL" and we let B be the ideal in 7(L) generated by all of the elements of 
the form 


[xy] -—x®@y+y@x, x,yeL (=W). 


Put U(L) = T(L) / B and let u be the restriction to L of the canonical homomorphism of 7(L) 
onto U(L) = T(L) / B. Then u is a homomorphism of the Lie algebra Linto U(L). We call U(L) 
the (universal) enveloping algebra of the Lie algebra L. We claim that (U(L), u) constitutes a 
universal from Lto the functor F. 


We have to show that if A is any associative algebra and g is a homomorphism of L into £, 
then there exists a unique homomorphism 9 of associative algebras, #:¥(4)-4, such that the 
following diagram of Lie algebra homomorphisms is commutative 


L u UL- 


A- 


We have seen that we have a unique homomorphism g* of T(L) into A extending the K- 
homomorphism g of L into A. Now sly) = ga) = gox) for x, y € L, since g is a homomorphism 
of L into ÆA. Hence 


g*([xy] — x®y + y@x) = gl[xy]) — glxdgly) + gly)g(x) = 0. 


which implies that B c ker g*. Hence we have a homomorphism 9 of U(L) = T(L) / B into A 
such that %*°+8)= 9s) erg) =99 | Thus ð makes the preceding diagram commutative. Since L 


generates 7(L), u(L) generates U(L). Hence f is unique. 

We remark that the result we have established: existence of a universal to F for every Lie 
algebra is equivalent to the statement that the functor F from K-alg to K-Lie has a left adjoint 
from K-Lie to K-alg. A similar remark can be made for symmetric algebras. We observe also 
that a symmetric algebra of a K-module M can be regarded as the enveloping algebra of the Lie 
algebra M in which the composition is the trivial one: [xy] = 0. 


We consider next the tensor products of associative algebras. Let (4; 7; €), i = 1, 2, be 


algebras in the sense of Definition 3.6. Then *'4@4:~ A» 6:K->4, and we have the 
commutativity of the diagrams given in the definition. Now consider the K-modules 4, ® A, 


and (4:@42)@(4,@42), Using the fact that we have an isomorphism of A, ® A, onto A, ® A, 


sending every 2@*:1**:@*%» %€4i and the associativity of tensor products, we obtain an 
isomorphism of (4:942)@(4,@42) onto (A; @ A,) 8 (A, 8 A) such that 


(X;@X2)@(Vy @Yo) > (X; @Y1)@(X2@ yo). 


Following this with z, ® m, we obtain a K-homomorphism z of (4:942)@(4,@42) into A, ® A, 
such that 


(X; @X2)@(V Ay) > My (Xj OY) ATX). 


In other words, we have a product composition in A, ® A, such that 


(69) (X1 @X2\V1 @Y2) = X1V1 @Xaye2 
in terms of the products in A, and A,. Also put 
(70) | = 18l; 


Then for x;, Y; z; € 4; we have 


((X; @Xa)(V; @V2))(Z, @Z2) = (%1)1)21 @(X2V2)22 
(X;@X9)((V @Y2)(Z; @2Z2)) = X4(V121)@X2(V 229). 


This implies the associativity of the multiplication in A = A, ® A. We have 


I(x; 8x) = x,@x, = (x, @x,), 


which implies that 1 is the unit under multiplication. Hence Æ is a K-algebra. We call A= A, @ 
A, the tensor product of A, and A). 

The tensor product of algebras is characterized by a universal property, which we proceed 
to describe. We note first that the maps x, ~ x; @ 15, x2 ~ 1, ® x, are homomorphisms of A, 
and A, respectively into A, ® A,. (These need not be injective.) We have 


(71) xL AX) = x, 8x = (1,@x,)(x,@1,), 


that is, if 61X: + *1@lz and ez:x2 ~ 1,@xz | then e&s(*idea(%2) = ealxzdel%) for all x, € Aj, x. € A>. Now 
suppose f; is a homomorphism of 4, i = 1, 2, into an algebra B such that A&M) = 
hai), x; € A; Then we claim that we have a unique algebra homomorphism f of A, 8 A, 
into B such that fe; = f; To prove this we define a map (x), x2) f(x )fo(x2) of A, x A> into B. 
This is K-bilinear, so we have a K-module homomorphism f of A, ® A, into B such that 
fix, ®xy) =A) | Since AD =/0,81) =f All) = 1 =1 and 


fx XY Q V>)) = f(x,y; XV) 


= fiA DADS O Daa) = f(x Qx) (y1®y2), 


f is a K-algebra homomorphism. We have fils) 5/081) SAADA) = how and fex) = hx), 
so fe; = f; The relation (71) and the fact that every element of 4, ® A, is a sum 2i*1/@%2p WE Ai 
imply that f is the only homomorphism of A, ® A, satisfying fe; = f;. 

Tensor multiplication of algebras is associative and commutative in the sense of the 
following 


THEOREM 3.16. We have an isomorphism of A, ® A, onto A, ® A, mapping x, ® x, into x, 
® x, and an isomorphism of (A, ® A,)® Az onto A, ® (4 @ A) mapping 
(xxx; into x, @(x,@Xs), HEA, 


Proof. We have seen that we have module isomorphisms of the form indicated. Direct 
verification shows that these are algebra maps. o 


We have observed several instances in which the use of Definition 3.6 for an algebra is 
advantageous. Perhaps its most important advantage is that this definition suggests a dual 
concept. We present this as 


DEFINITION 3.7. A coalgebra over K (or K-coalgebra) is a triple (C, ô, a) such that C is a 
K-module, ô is a K-homomorphism C —> C ® gC, and a is a K-homomorphism C — K such 


that the following diagrams are commutative: 


C@®C@eC 501 CEE 


COC 3 a 


c@ec 


Kec COK 


where the maps C —> C ® K and C+K@C are x~>x®1l and x~>1®x, The map ô is called a 
diagonalization and a is called an augmentation. 


It is nice to have a pretty definition, but it is even nicer to have some pretty examples. Here 
are a couple for coalgebras. 


EXAMPLES 


1. Let G be a group and let K[G] be the group algebra over K of G (see BAI, p. 127 
(exercise 8) and p. 408). This has G as module base over K and the product in K[G] is given 
by the formula SA9 ©he) = 2k where k, l, € K, s; t; € G. As usual, we identify the element 
ls, s € G, with s and so regard G as imbedded in K[G]. The fundamental property of K[G] is 
that if g is a homomorphism of G into the multiplicative group of invertible elements of an 
algebra A, then g has a unique extension to a homomorphism of K[G] into A. Now take 
A = K[G]®xK(6), It is clear that the map s ~ s ® s of G into A is a group homomorphism, so this 
has an extension to an algebra homomorphism ô of K[G] into K[G] ® K[G]. We also have a 
unique algebra homomorphism a of K[G] into K extending the homomorphism s ~ 1 of G into 
the multiplicative group of K. Then (K[G], ô, a) is a coalgebra. We leave it to the reader to 
verify the conditions in the definition. 

2. Let U(L) be the enveloping algebra of a Lie algebra L. If u is the map of Linto U(L), then 
we have "Lyp = Leekw) (= webduly) — yw) for x,y € L. Put “4*0! + l@uMeUIL@UIL) Tt follows 
from the commutativity of any a ®1 and 1 ® b, a, b e U(L) that 


[u;(x) u(y)] = u([xy]) 


SO Uy is a homomorphism of L into U(L)® U(L). This has a unique extension to a 
homomorphism 6 of U(L) into U(L)® U(L). Similarly, we have a homomorphism a of U(L) 
into K such that u(x) ~ 0 for all x e L. We leave it to the reader to verify that (U(L), ô, a) is a 
coalgebra. 


Both of the preceding examples are algebras as well as coalgebras. Moreover, the maps ô 
and a are algebra homomorphisms. In this situation, the composite system (C, z, €, 0, a) is 
called a bialgebra. Bialgebras of a special type, now called Hopf algebras, were introduced 
by Heinz Hopf in his studies of the topology of Lie groups. 


EXERCISES 


— 


. Let M and N be vector spaces over a field K, M* and N* the dual spaces. Verify that if f 


e M*, ge N*, then (x, y) ~ f(x)g(y) is a bilinear map of M x N into K Hence there is a 
unique h e (M @ N)* such that "*®») =f), Show that there is a vector space 
monomorphism of M* ®© N* into (M @ N)* such that f® g ~ h and that this is an 
isomorphism if M and N are finite dimensional. 


. Let M and N be finite dimensional vector spaces over a field K. Show that there is an 


isomorphism of Endy M ® x EndxN onto End,(M ® kN) such that f ® g for fe EndgM 
and g € End;,N is mapped into the linear transformation of M ® N such that *®y ~/M@9). 
Does this hold if M or N is infinite dimensional? 


. Let (A, 24, £4) and (B, zp, €p) be algebras over K. Show that a K-homomorphism 8 : A 
— B is an algebra homomorphism if and only if the diagrams 
A ma AQA 
j (ROR) a K 
B Wa B@B 


commute. 
Dualize these diagrams to obtain a definition of homomorphism of coalgebras. 


. Let (C, z, €) be an algebra and (C, ô, a) be a coalgebra. Show that (C, z €, ô, a) is a 


bialgebra if and only if z and ¢ are coalgebra homomorphisms. 


. Let (C, ô, a) be a coalgebra over a field K and let p denote the monomorphism of C* @ 


C* into (C @ C)* given in exercise 1. Put z = 6*p where **:(C@C)"~C* is the transpose of 
the map 6: C— C ® C (p. 21). Let e = a*i where i is the canonical isomorphism of K 
onto K* = hom (K, K) and a* is the transpose of a. Verify that (C*, x, €) is an algebra. 
This is called the dual algebra of (C, ô, a). 


. Let (A, z, €) be a finite dimensional algebra over a field K. Let t be the isomorphism of 


(A ® A)* onto A* ® A* given in exercise 1. Put ô = tz* where **:4*=(484)" is the 
transpose of z and a = i` !e* where i is as in exercise 5 and é* is the transpose of e. 
Verify that (A*, ô, a) is a coalgebra. This is called the dual coalgebra of A. 


. Let C(n) have the base {x, | 1 <i, 7 <n} over a field K. Let ô and a be the linear maps 


such that %*y) = ix: *a@%y and ay) = Oy | Verify that (C(n), ô, a) is the dual coalgebra of 
M,,(K). 


. If (A, z, €) is an algebra, a left A-module is defined as a K-module M and a map 


w:AQ@,M—>M such that 


10. 


11. 


12. 


KOM A@A@QM lOp AOM 


€@!1 M and Ol B 


commute. 
Dualize to define comodules for coalgebras. 


. An algebra A is called filtered if for every i = 0, 1,2,... there is defined a subspace AY 


such that © 4® = 4% if i<j, Gi) UA" =A, fii) A®A c AY** | Show that if X is a set of generators of 
A, then A is filtered by defining 4° =K1+KX+KX*+:+KX' where K Yis the K-submodule 


spanned by the set Y and X is the set of products ofj elements in X. 


Let A be a filtered algebra and define (4) = 864 where A, = 4/4" for i > 0, Ay = AO. Show 


that G(A) can be given the structure of a graded algebra in which 
(a + AN (a; + A) = aay + AT | GCA) is called the associated graded algebra of the 
filtered algebra A. 


If K is a commutative ring, let A= K[A]/ (47), A an indeterminate. Show that 1 and the 


coset d= å + (4°) forma K-base for A and d’ = 0. A is called the algebra of dual 
numbers over K. Let A be a K-algebra and form the algebra A ® gA. Note that this is a 


right „A-module in which CS» = 6@xy for beA, xyeA. Show that 1 (=1@1)andd=d®@1 
forma base for A @ A over A and we have 


(x, + dy,)(x, + dyz) = x,x, + d(x, y2 + y,x3) 


where x; = 1 - x; and x; y; € A. Show that a K-endomorphism D of A is a derivation, that 
iS, 

D(xy) = (Dx)y + x(Dy) 
for x, y € A if and only if the A-endomorphism of A ® 4 such that x ~ x + d(Dx), x € A, 
is an automorphism of A ® gA Show that if a e A, then / + da is invertible in A ® xA 
with inverse 1 — da and (!+da)x{l-da) = x+dlax—xa), Hence conclude that x + [a, x] = ax — xa 
is a derivation. This is called the inner derivation determined by a. 
Generalize exercise 11 by replacing A by 4” = KIG. »>2, to obtain A” ® pA, which is 
a right A-module with base (1, d, d’,..., d’~ 1) where d =A + (4”) and d is identified 


with the element d ® 1 of A @ A. Call a sequence 9 = (P= 1.Di..--D,-1) of K- 
endomorphisms of A a higher derivation of order n if 


Di(xy)= ¥ (Djx)(Dyy) 


jtk=i 


for x,y E€ A,0 <i<n-—1. Show that D = is a higher derivation of order n if and only if 
the A-endomorphism of A ® A such that 


x>x+d(D,x)+d?(D,x)+:+:+d"" '(D,_ 1x) 


is an algebra automorphism. 
13. Let M and N be modules over a commutative ring K, A and B algebras over K, K' a 
commutative algebra over K, K" a commutative algebra over K'. Write Mg. = K'® gM 


etc. Note that K" is an algebra over K in which kk" = (A1)k" for ke K, k" e K" and 1 is 
the unit of K’. Prove the following isomorphisms: 


(i) MOr Ng =(M@qN)p. as K'-modules, 
(ii) Ap @p By = (A®@B)y. as K’-algebras. 
(iii) (Mg)ge = Mx. as K"'-modules. 

(iv) (Ag) = Agr as K”-algebras. 


3.10 PROJECTIVE MODULES 


In this section we shall introduce a class of modules that are more general than free modules 
and that, from the categorical point of view, constitute a more natural class. We recall a basic 
homomorphism lifting property of free Q-algebras: If F is a free Q-algebra and Å is a 
surjective homomorphism of the Q-algebra A onto the Q-algebra A, then any homomorphism / 
of F into A can be “lifted” to a homomorphism f of F into A, that is, there exists a 
homomorphism f of F into A such that J = Af (p. 80). A similar result holds for free modules. If 
we have a diagram 


>j 


M N fr 


where F is free, we can complete it by an f: F — M to obtain a commutative diagram. The 
proof is the same as for Q-algebras: Let X be a base for F. For each x € X choose an element u 
€e M such that Au = Jx and let f be the homomorphism of F into M such that fx = u. Then 4fx = 
Îx for all xe X so Af= J, as required. 

We single out this property of free modules as the basis for the following generalization. 


DEFINITION 3.8. A module P is called projective if given any diagram 


f 
M a N 
there exists a homomorphism g: P > M such that 
P 
S f 
M p N 


is commutative. In other words, given an epimorphism p:M — N, then any homomorphism f: 
P — N can be factored as f = pg for some g: P — M. 


We recall that for any module M, the functor hom (M, —) is left exact. Hence if 
(72) 0+N'+N4N"30 
is exact, then 


0> hom (M, N’) ee, hom (M, N) ee?) hom (M, N”) 


is exact. Now suppose M = P is projective. Then given f e hom (P, N") there exists a g € 
hom(P, N) such that hom (P, p) (€) = pg = f. Thus in this case, hom (P, p) is surjective and so 
we actually have the exactness of 


0 = hom (P, N’) ee, hom (P, N) on”, hom (P, N”) > 0 


as a consequence of the exactness of (72). Hence if P is projective, then hom (P, — )is an exact 
functor from mod-R to mod-Z. 

The converse holds also. Suppose hom (P, —) is exact and suppose M = N. Let K = ker p. 
Then we have the exact sequence 0+K 4M %N--0 where i is the injection map. Applying the 


exactness of hom (P, —), we obtain the property stated in Definition 3.8. We have therefore 
established the following categorical characterization of projective modules. 


PROPOSITION 3.9. A module P is projective if and only if hom (P, —) is an exact functor. 
How close are projective modules to being free? We shall give some answers to this 
question. First, we need a simple result on short exact sequences 


(73) 03M’5M3M"30. 


Suppose that for such a sequence there exists a homomorphism 7’: M" — M such that pi’ = 1yr. 
Then Pt!w-'p)=e-e =. This implies that there exists a p’: M — M' such that 1), — i'p = ip’. For, 
ifx e M then py — i'p)x = 0, so (lọ — i'p)x is in the kernel of p, hence, in the image of i. 
Then we have an x’ e M' such that ix’ = (1), —i'p)x and x’ is unique since 7 is a monomorphism. 


We now have a map p’: x ~ x’ of M into M'. It follows directly that p' is a module 
homomorphism and ix’ = ip'x so ip’ = 1), —i'p or 


(74) i'p+ip’ = l y. 


Also we have “pip =!wp =p, Hence multiplying (74) on the right by i'p gives ip'i'p = 0. Since 
p is surjective and 7 is injective, this implies that p'i' = 0 (on M"). Ifx' € M' then ip'ix'=(1 y- 
i'p)ix' = ix' and since i is injective, we have p'i = ly. Since we had at the outset pi = 0 and pi’ 
= ] y", we have the four relations 


(75) 


These together with (74) imply that M is canonically isomorphic to M' @ M" (see p. 97). 

In a similar manner, suppose there exists a p: M — M' such that p'i = lj. Then 
(u= ip)i=i-ily =i-i= 0, This implies that 1), — ip’ = i'p for a homomorphism i’: M" — M. For, 
if fis any homomorphism M — M such that fi = 0, then ker p =imi c ker f. Hence px ~ fx is a 
well-defined map g of pM = M" into M. Direct verification shows that this is a module 
homomorphism. Evidently, we have gp = f. Applying this to f= 1 jy — ip’, we obtain 7’: M" — 
M such that 1), = ip’ + i'p. Since P= ilwp =", we obtain ip'i'p = 0, which implies p'i' = 0. 
Since Pip = pPllu- ip’) = p-pir =P (by pi = 0), pi'p = p. Since p is surjective, this implies that pi’ = 
1 yr. Again we have the relations (74) and (75), so M = M'®M” canonically. 

We shall now say that the exact sequence (73) splits if there exists an i’: M" — M such that 
pi' = l y or, equivalently, there exists a p": M — M' such that p'i = 1 yf. 

We can now give two important characterizations of projective modules. 


PROPOSITION 3.10. The following properties of a module P are equivalent: 
(1) P is projective. 
(2) Any short exact sequence 0 ~ M—> N— P > 0 splits. 


(3) P is a direct summand of a free module (that is, there exists a free module F 
isomorphic to P ® P' for some P’). 


Proof. (1) = (2). Let 0+m4N4P-+0 be exact and consider the diagram 


P 


Ip 


N P 


d 


By hypothesis we can fill this in with g': P — N to obtain a commutative diagram. Then gg’ = 
lp and the given short exact sequence splits. 

(2) = (3). Since any module is a homomorphic image of a free module, we have a short 
exact sequence 9+? +F “P+0 where F is a free module. If P satisfies property 2, then 
0+P+F%P-0 splits and hence F = P @ P. 

(3) =(1). We are given that there exists a split exact sequence 0= P>F&pP=0 with F free. 
Now suppose we have a diagram 


P 
f 
M ; y 
Combining two daigrams, we obtain 
D— e; — F — P ——— 0 
m 
fp j 


where pi' = 1p (since the top line splits). Since F is free, hence projective, we can fill in g: F 
— M to obtain fp = qg. Then f=S lr =f? = a9? and gi’: P — M makes 


Commutative. Hence P is projective. O 

As a first consequence of Proposition 3.10, we note that there exist projective modules that 
are not free. The simplest example is perhaps the following one. Let F = Z /(6) regarded as a 
module for R = Z /(6), so F is free. Now F = Z /(2) @ zZ /(3). Hence Z /(2) and Z /(3) are 
projective by Proposition 3.10 and evidently these modules are not free 2 /(6)-modules. 

Of particular interest are the modules that are finitely generated and projective. The 
proposition gives the following characterization of these modules. 


COROLLARY. 4 module P is finitely generated and projective if and only if P is a direct 
summand of a free module with a finite base. 


Proof. If P is a direct summand of a free module F with finite base, then P is projective. 
Moreover, P is a homomorphic image of F, so P has a finite set of generators (the images of 
the base under an epimorphism of F onto P). Conversely, suppose P is finitely generated and 
projective. Then the first condition implies that we have an exact sequence 0 — P' > F —> P 
— 0 where F is free with finite base. The proof of the theorem shows that if P is projective, 
then F = P @ P', so P is a direct summand of a free module with finite base. O 


Some rings have the property that all of their projective modules are free. This is the case 
for p.i.d. (commutative principal ideal domains). We showed in BAI, pp. 179-180, that any 
submodule of a free module with a finite base over a p.i.d. D is free. This can be extended to 
prove that any submodule of any free module over D is free. By Proposition 3.10.3, this 
implies that any projective module over a p.i.d. is free. We shall prove later (p. 416) that any 
finitely generated projective module over a local ring is free. Another result of this sort, 
proved independently by D. Quillen and A. Suslin, is that if R = D[A,,..., 4,] where D is a 
p.i.d., then every finitely generated module over R is free. Their work settled a question that 
had been raised in 1955 by J.—P. Serre and that had been unanswered for more than twenty 
years. 

We shall give next an “elementary” criterion for projectivity that is extremely useful. 


PROPOSITION 3.11. A (right) R-module P is projective if and only if there exists a set 
{xa | a € I} of elements P and elements {x,* | a € I} of the dual module P* = hom (P, R) 


such that for any x € P, x,*(x) = 0 for all but a finite number of the x,*, and * = Lx xT) 
(with the obvious meaning of this sum). If the condition holds, then {x,} is a set of 
generators and if P is projective, then {x,} can be taken to be any set of generators for P. 


Proof. Suppose first that P is projective and {x, | a € J} is a set of generators for P. We have 


a free module F with base {X, | a € I} and an epimorphism p: F — P such that p(X,) = xa 
Since P is projective, we have a homomorphism i’: P — F such that pi’ = 1p. Since {X, is a 


base for F, any X can be written as a sum *n4*+*%s.4. 4° +X. 0, 


where the dg, € R. We may 
write also X = 2«*.4, where only a finite number of the a, are # 0 and the sum is taken over the 
a; for which a, # 0. Since the X, form a base, the a, (0 or not) are uniquely determined by X. 
Hence we have the maps X,*: X ~ a, and for a particular X we have X,*(X) = 0 for all but a 
finite number of the a. It is immediate from the definition of X,* that_X,* € F* = hom(/, R). 
Then x,* = X,*i' € P* and for any x € P, #0) = X32") =9 for all but a finite set of a. Ifx e P 
we have i(x) = X e F and X = } X,X,*(X). Since x, = p(X,), we have 
x = pi'(x) = p(X) = Ep(X,)X2 00 = Ex, XIX) = Dx, AF'(x) = Exa x7(x)., Thus {x,,} and {x,,*} have the stated 
properties. Conversely, assume that for a module P we have {x,, {x,*} with the stated 
properties. Again let F be the free module with base {X,,} and let p be the homomorphism F — 
P such that p(X) = xa: For each a we have the homomorphism x ~ X,x,*(x) of P into F. Since 
for a given x, x,*(x) = 0 for all but a finite number of a; we have a homomorphism 7’: P > F 
such that "(x)= 2X,xf0), Then Pi) == PX hots) = Exxx) = x, Thus pi’ = lp, so we have a split 
exact sequence 0+ P’+F4P-+0. Then P is a direct summand of F and hence P is projective. oO 


The result proved in Proposition 3.11 is often called the “dual basis lemma” for projective 
modules. It should be noted, however, that the x, in the statement need not be a base for P. 


The dual basis lemma is usually used to characterize finitely generated projective modules. 
Proposition 3.11 and its proof have the following immediate consequence. 


COROLLARY. 4 module P is finitely generated and projective if and only if there exists a 
finite set of pairs (x; x;*) where x; € P and x;* € P* such that for any x € P we have x = Yx;, 


x;*(x). 


i Xj 


There are a number of important properties of free modules that carry over to projective 
modules. One of these is flatness, which we define in 


DEFINITION 3.9. 4 right module M = Mp is ia ig if for every monomorphism N' 4 N 


of left R-modules we have the monomorphism M@N' 2L MON of £-modules. 


We recall that the functor M ®p from R-mod to Z-mod is right exact (Theorem 3.15, p. 
132). As in the proof of Proposition 3.9, we have the following immediate consequence 


PROPOSITION 3.12. A right module M is flat if and only if the tensor functor M ® p is 
exact. 


The main result on flatness that we shall establish in this section is that projectives are flat. 
The proof will follow quite readily from the following 


PROPOSITION 3.13. M=® M, is flat if and only if every M, is flat. 


Proof. Suppose N'/ N for left modules N’ and N. Our result will follow if we can show that M 
®N M®Nifand only if M, ® N' M, @N for every a. If we use the isomorphisms yy and yy’, 
given in Proposition 3.3 and the naturality given by (54), we see that it suffices to show that we 
have ®(M.@N)‘ @(M,.®N) if and only if M, ® N' M, ® N for every a. This is clear from the 
definition of f*: f* is injective if and only ifevery ly © fis injection. O 


We can now prove 


THEOREM 3.17. Projective modules are flat. 


Proof. Observe first that R = Rp is flat, since we have an isomorphism of R ® N into N 
sending a ® y ~ ay. Hence N' 4 N is injective if and only if R ® N' R @ N is injective. Both 
the flatness of R and Proposition 3.13 imply that any free R-module is flat. Since any 
projective module is a direct summand of a free module, another application of Proposition 
3.13 gives the result that any projective module is flat. O 


The technique of reduction to R by means of a result on direct sums is a standard one for 
establishing properties of projective modules. As another illustration of this method, we sketch 
a proof of 


PROPOSITION 3.14. Let K be a commutative ring, P a finitely generated projective K- 
module, N an arbitrary K-module. Then the homomorphism of EndgP ® EndgN into Endg(P 


®© N) sending f @ g for fe End P, g € End N, into the endomorphism f È g of P® N such that 
(S89) (x@y) =W) is an isomorphism (as K-algebras). 


Proof. It is clear that the homomorphism is an algebra homomorphism. Moreover, it is an 
isomorphism if P = K. Now suppose M = ®i“), Then we have a canonical isomorphism of End 
M = hom (M, M) onto ® ; , hom (M;, M,) and of End M ® End N onto ®(hom(M,, M,) ® End 
N). Also we have a canonical isomorphism of End (M ® N) = hom (M @ N, M @ N) onto @ 
(hom (M; ® N), (M, ®© N)). Using these isomorphisms, we see that the homomorphism of End 
M ® End N into End (M ® N) sending f @ g into f® g, as in the statement of the proposition, is 
an isomorphism if and only if for every (j, k) the same map of hom (M;, M,)® End N into hom 
(M; ® N), (M; ® N) is an isomorphism. Taking the M, = K, we see that the result stated holds 
for P, a free module with a finite base. Then the result follows for finitely generated projective 
P, since such a P is a direct summand of a free module witha finite base. O 


We leave it to the reader to fill in the details of this proof. 


— 


EXERCISES 


. Show that if we have a diagram 


he 


in which the row is exact, P is projective, and hf = 0 then there exists a k: P — M such 
that f= gk. 


. Show that if P}, a € J, is projective, then ® P, is projective. 


. Show that if e is an idempotent in a ring R, then eR is a projective right module and Re is 


a projective left module. 


. Let {e;} be the usual matrix units in M,(A), A a division ring. Show that e,,M,(A) is 


irreducible as right /,(A)-module. Hence conclude that e}, M,(A) is projective but not 
free ifn > 1. 


. Prove that any submodule of a free module over a p.i.d. D is free. (Hint: Extend the 


argument of BAI, p. 179, by transfinite induction or by Zorn’s lemma.) 


6. Show that the additive group of @ regarded as 2-module is flat. 


7. Let R and S be rings. Let P be a finitely generated projective left R-module, M an R-S- 


3.11 


bimodule, N a left S-module. Show that there is a group isomorphism 
n:hom,(P,M)@ N > hom,(P,M@sN) 


such that for fe homp(P, M) and y € N, n(f @ y) is the homomorphism x ~ fx)® y; of P 
into M ® N. 


. (Schanuel’s lemma.) Suppose we have a short exact sequence 0 — N; > P; > M — 0 


where P, is projective and i = 1, 2. Show that P@M: š P©N:, (Hint: Consider the pullback 
of g and g, as constructed in exercise 6, p. 36.) 


INJECTIVE MODULES. INJECTIVE HULL 


The concept of a projective module has a dual obtained by reversing the arrows in the 
definition. This yields the following 


DEFINITION 3.10. 4 module QO is called injective if given any diagram of homomorphisms 


ee M 
(76) f 


Q 


there exists a homomorphism g: M — Q such that the diagram obtained by filling in g is 
commutative. In other words, given f. N —> Q and a monomorphism i: N — M there exists a 
g: M > Ọ such that f= gi. 


With a slight change of notation, the definition amounts to this: Given an exact sequence 0 
— N' > N, the sequence 


hom (i,Q) 


hom (N,Q) hom (N’,Q)—0 


is exact. Since we know that exactness of N’ + N % N" — 0 implies exactness of 


hom (p, M) hom (1,M) 


hom (N, M)——— hom (N', M) 


0— hom (N”, M) 


(Theorem 3.1, p. 99), it is clear that O is injective if and only if the contravariant hom functor 
hom( —, Q) is exact in the sense that it maps any short exact sequence 0 — N' > N— N" — 0 
into a short exact sequence 


0 —> hom (N",Q) + hom (N,Q) > hom (N’,Q) > 0. 


It is easily seen also that the definition of injective is equivalent to the following: If N is a 
submodule of a module M, then any homomorphism of N into Q can be extended to a 
homomorphism of M into Q. Another result, which is easily established by dualizing the proof 
of the analogous result on projectives (Proposition 3.9, p. 149), is that if Q is injective, then 
any short exact sequence 0 — Q — M — N — 0 splits. The converse of this holds also. 
However, the proof requires the dual of the easy result that any module is a homomorphic 
image of a projective module (in fact, a free module). The dual statement is that any module 
can be imbedded in an injective one. We shall see that this is the case, but the proof will turn 
out to be fairly difficult. 


The concept of an injective module was introduced in 1940 by Reinhold Baer before 
projective modules were thought of. Most of the results presented here are due to him; in 
particular, this is the case with the following criterion. 


PROPOSITION 3.15. 4 module Q is injective if and only if any homomorphism of a right 


ideal I of R into Ọ can be extended to a homomorphism of R into Q. 


Proof. Obviously, the condition is necessary. Now suppose it holds and suppose M is a 
module and f is a homomorphism of a submodule N of M into Q. Consider the set {(g, M} 
where M' is a submodule of M containing N and g is a homomorphism of M’ into Q such that g | 
N =f. We define a partial order in the set {(g,M')} by declaring that (g1, M'i) = (e2, M'>) if M'i 
> M’, and g; | M', = g>. It is clear that any totally ordered subset has an upper bound in this set. 
Hence, by Zorn’s lemma, there exists a maximal (g, M’); that is, we have an extension of f to a 
homomorphism g of M' > N which is maximal in the sense that if g} is a homomorphism of an 
M', > M' such that g, | M' = g, then necessarily M,’ = M'. We claim that M' = M. Otherwise, 
there is anx € M, É M' and so xR + M' is a submodule of M properly containing M’. Now let 


(77) I = {seR|xseM’. 


Then / = ann (x + M’) in M / M', so T is a right ideal of R. If S e / then xs e M', so g(xs) € Q. It 
is immediate that the map h : s + g(xs) is a module homomorphism of / into Q. Hence, by 
hypothesis, A can be extended to a homomorphism k of R into Q. We shall use this to obtain an 
extension of g to a homomorphism of xR + M' to Q. The elements of xR + M' have the form xr 
+y,r € R,y € M'. If we have a relation xs + y'= 0, s e R, y' e M' then the definition (77) 
shows that s € 7. Then 


k(s) = h(s) = g(xs) = —g(y’). 
Thus xs + y'= 0 for s € R, y' e M' implies that k(s) + g(y’) = 0. It follows that 
(78) xr+y > k(r)+g(y), 


re R,y e M',is a well-defined map. For, if xr; + y; = xr + yan, r; E€ R, y; E M', then xs + y' = 
0 for s =r, — ra, y'= y; — y2. Then k(s) + g(y’) = 0 and k(r; — r2) + giv; — y2) = 0. Since k and 
g are homomorphisms, this implies that k(r1) + g1) = k(r2) + g(y2). Thus, (78) is single- 
valued. It is immediate that the map rx + y ~ k(r) + g(y) is a module homomorphism of xR + M 
' into Q extending the homomorphism g of M'. This contradicts the maximality of (g, M’). 
Hence M'= M and we have proved that if fis a homomorphism of a submodule N of M into Q, 
then f can be extended to a homomorphism of M into Q. Hence Q is injective. O 


For certain “nice” rings, the concept of injectivity of modules is closely related to the 
simpler notion of divisibility, which we proceed to define. If a € R, then the module M is said 
to be divisible by a if the map x ~ xa of M into M is surjective. A module is called divisible if 
itis divisible by every a + 0. It is clear that if M is divisible by a or if M is divisible, then any 
homomorphic image of M has the same property. In some sense injectivity is a generalization 
of divisibility, for we have 


PROPOSITION 3.16. Jf R has no zero divisors # 0, then any injective R-module is 
divisible. If R is a ring such that every right ideal of R is principal ( = aR for some aé R), 


then any divisible R-module is injective. 


Proof. Suppose R has no zero-divisors + 0 and let Q be an injective R-module. Letx € Q, re 
R,r #0. Ifa, b e R and ra = rb, then a = b. Hence we have a well-defined map ra + xa, a € 
R, of the right ideal rR into Q. Clearly this is a module homomorphism. Since Q is injective, 
the map ra ~ xa can be extended to a homomorphism of R into Q. If 1 ~ y under this extension, 
thenr = Ir + yr. Since r=rl ~ xl =x, we have x = yr. Since x was arbitrary in Q and r was 
any non-zero element of R, this shows that Q is divisible. Now suppose R is a ring in which 
every right ideal is principal. Let M be a divisible R-module and let f be a homomorphism of 
the right ideal rR into M. If r = 0, then f is the 0 map and this can be extended to the 0 map of 
R. If r 4 0 and f(r) =x e M, then there exists a y in M such that x = yr. Then a ~ ya is a 
module homomorphism of R into M and since rb ~ yrb = xb = fir)b = f(rb), a ~ ya is an 
extension of f. Thus any module homomorphism of a right ideal of R into M can be extended to 
a homomorphism of R. Hence M is injective by Baer’s criterion. O 


If R satisfies both conditions stated in the proposition, then an R-module is injective if and 
only if it is divisible. In particular, this holds if R is a p.i.d. We can use this to construct some 
examples of injective modules. 


EXAMPLES 


1. Let R be a subring of a field F and regard F as an R-module in the natural way. Evidently 
F is a divisible R-module. Hence if K is any K-submodule of F, then F / K is a divisible R- 
module. 


2. Let D be a p.i.d., F its field of fractions. If € D, then F / (r) ((r) = rD) is divisible and 
hence is injective by Proposition 3.16. 


Our next objective is to prove that any module can be imbedded in an injective module, that 
is, given any M there exists an exact sequence 0 — M > Q with Q injective. The first step in 
the proof we shall give is 


LEMMA 1. Any abelian group can be imbedded in a divisible group ( = £-module). 


Proof. First let F be a free abelian group with base {x,} and let F” be the vector space over Q 
with {x,} as base. Then F is imbedded in Æ" and it is clear that F” is divisible. Now let M be 


an arbitrary abelian group. Then M is isomorphic to a factor group F / K of a free abelian 
group F. Then F"/ K is a divisible group and F/K = M is a subgroup. oO 


An immediate consequence of this and Proposition 3.16 is the 
COROLLARY. Any -module can be imbedded in an injective 2—module. 


We now consider an arbitrary R-module M. We have the isomorphism of M onto hom,(R, 


M) which maps an element x e M into the homomorphism f, such that 1 ~ x. This is an R- 
isomorphism if we make hom ,(R, M) into a right R-module as in Proposition 3.4 by defining 
fa,a € R, by (fa)(b) = fab). Also homz (R, M) is a right R-module using this definition of fa. 
Clearly, homp(R, M) is a submodule of homa (R, M). Since M is isomorphic to hom,(R, M), 


we have an imbedding of M in homa (R, M). Now imbed M in an injective 2-module Q, which 
can be done by the foregoing corollary. Then we have an imbedding of homa (R, M) into homz 
(R, Q) and hence of M in homa (R, Q) as R-modules. Now this gives an imbedding of M in an 
injective R-module, since we have the following 


LEMMA 2. Jf Q is a @-injective 2-module, then homa (R, Q) is an injective R-module. 

Proof: We have to show that if 0 — N' 4 Nis an exact sequence of R-modules, then 

(79) hom,(N, hom;(R,Q))  hom,(N’,hom;(R,Q)) > 0 

is exact where /* = homp(f, homz (R, Q)). By Proposition 3.8 we have an isomorphism 
~y:hom7(N@xR,Q) > hom,(N, hom;(R, Q)) 


and the definition shows that this is natural in N. Since the isomorphism of N ®@p R onto N such 
that y ®© 1 ~ y is natural in N, we have an isomorphism 


Wy:hom,(N,Q) > hom,(N, hom,(R, Q)) 


which is natural in N, that is, we have the commutativity of 


hom,(N,Q) Yy hom,(N,hom,(R,Q)) 
f f 
hom,{N’,Q) Yy. hom,(N’,hom,(R,Q)) 


where J = hom (f, Q). Now Å is surjective since @ is Z-injective. Since wy and yy, are 
isomorphisms, this implies that /* is surjective. Thus (79) is exact. O 


The foregoing lemma completes the proof of the imbedding theorem. 
THEOREM 3.18. Any module can be imbedded in an injective module. 


The proof we have given is due to B. Eckmann and A. Schöpf. 


We can apply the theorem to complete the following characterization of injectives, which 
we indicated earlier. 


PROPOSITION 3.17. A module Q is injective if and only if every short exact sequence 0) 
+O > M N —> 0 splits. Equivalently, O is injective if and only if it is a direct summand 
of every module containing it as a submodule. 


Proof. We have seen that if Q is injective then every exact sequence 0 — Q > M > N— 0 
splits (p. 156). Conversely, suppose Q has this property. By the imbedding theorem we have an 
exact sequence 0 — Q > M where M is injective. Then we have the short exact sequence 0 > 
OQO + M % M/ Q — 0 where p is the canonical homomorphism of M onto M / Q. By 
hypothesis, we can find a p’: M — Q such that p'i = 19. Now suppose we have a diagram 


N’ j 
ee 


Q 


Since M is injective, we can enlarge this to a commutative diagram 


This means that by the injectivity of M we have g: N — M such that if = gj. Then f= lof = p'if 
= (p'g)j. Hence Q is injective. O 

The imbedding theorem of modules in injective modules has an important refinement, 
namely, there exists such an imbedding, which is in a sense minimal, and any two minimal 
injective extensions are isomorphic. The key concepts for developing these results are those of 
a large submodule of a module and of an essential monomorphism. A submodule M of a 
module N is called large in N if N' N M £ 0 for every submodule MW’ # 0 of N. A 
monomorphism i: M — N is called essential if i(M) is a large submodule of N. In this case, N 
is called an essential extension of M. 

It is clear that if M is a large submodule of N and N is a large submodule of Q, then M is 
large in Q. Let M be any submodule of a module N and let E = {S} be the collection of 
submodules of N containing M as large submodule. Evidently, E # Ø since M e E. Moreover, 
if {S,,} is a totally ordered subset of E, then S' = US, is a submodule and if N’ # 0 is a 


submodule of S’, then M' N S, # 0 for some S,. Since M is large in S4, (VN S) N M #0; 
hence, N' N M #0. Thus M is large in S = (US, It follows from Zorn’s lemma that E contains 


a maximal element S, that is, we have a submodule S of N containing M as a large submodule 
and such that for any submodule T F S, M is not large in T. 
We can give a characterization of injective modules in terms of essential homomorphisms: 


PROPOSITION 3.18. 4 module Q is injective if and only if every essential monomorphism 
of Q is an isomorphism. 


Proof. Suppose Q is injective and let Q > M be an essential monomorphism of Q. We have 
an exact sequence 0 — Q + M — M/ Q — 0 that splits by Proposition 3.17. Then M = i(Q) 
® Q' where Q' is a submodule. Since Q' N i(Q) = 0 and i is essential, O' = 0. Hence M = i(Q) 
and i is an isomorphism. Conversely, suppose Q is a module such that every essential 
monomorphism of Q is an isomorphism and let 0 —> Q > M % N — 0 be exact. By Zorn’s 
lemma, there exists a submodule S of M such that S N i(Q) = 0 and S is maximal among the 
submodules of M having this property. Consider the module M / S. Since S N i(Q) = 0, we 
have the monomorphism j: x + i(x) + S of Q into M / S whose image is (i(Q) + S)/ S. If T/ S, 
T > S, is a submodule of M / S such that T/S N (i(Q) + S) / S = S/S, then T N (i(Q) + $) = S. 
Hence T N i(Q) c S N i(Q) = 0. Then T= S by the maximality of S. This shows that j is an 
essential monomorphism. Hence this is an isomorphism. Then M / S = (i(Q) + S) / S and M = 
i(Q) + S. Since S N i(Q) = 0 we have M = S @ i(Q), which implies that 0 —> O— M+ N— 0 
splits and proves that Q is injective by Proposition 3.17. O 


The main result on imbedding of modules in injective modules is 


THEOREM 3.19. Jf M is a module, there exists an essential monomorphism i of M into an 
injective module Q. If i and i' are essential monomorphisms of M into injective modules Q 
and Q' respectively, then there exists an isomorphism |: Q > Q' such that i' = li. 


For the proof, we require the following 


LEMMA. Suppose we have a diagram of monomorphisms 


Qo 
such that k is essential and Qo is injective. Then there exists a monomorphism |: N — Qo to 


make a commutative triangle. 


Proof. Since Qù is injective, we have a homomorphism /: N — Qp to make a commutative 
triangle. Let y € ker] N im k. Then y = kx, x € Q, and 0 = ly = [kx = jx. Then x = 0 and y = 0. 
Thus ker / N im k = 0. Since k is essential, we have ker /=0. Hence /is a monomorphism. oO 


We can now give the 


Proof of Theorem 3.19. Let ig : M — Qg be a monomorphism of M into an injective module 
Qo. The existence of ig and Qg were proved in Theorem 3.18. We showed also that there exists 
a submodule Q of Qg in which ip(M) is large and that is maximal for this property. Let i denote 
the monomorphism of M into Q obtained by restricting the codomain of ig to Q. Let j denote the 
injection of Q in Qù and let k be an essential monomorphism of Q into a module N. Then we 


have the diagram and conditions of the preceding lemma. Hence we have a monomorphism / : 
N — Qj such that /k = 7. Then /(N) is a submodule of Qg containing /(A(Q)) =j(Q) = Q. Since k 
is essential, k(Q) is large in N and since / is a monomorphism, Q = /(k(Q)) is large in /(N). 
Since i(M) is large in Q, i(M) is large in (N) > Q. It follows from the maximality of Q that 
I(N) = Q. As is readily seen, this implies that k(Q) = N and so k is an isomorphism. We have 
therefore shown that any essential monomorphism k of Q is an isomorphism. Then Q is 
injective by Proposition 3.18. Now let i’ be any essential monomorphism of M into an injective 
module Q’. Applying the lemma to 


we obtain the monomorphism / : Q — Q' such that li = i’. Since /(Q) is an injective submodule 
of QO’ we have a submodule QO” of QO’ such that QO’ = 1(Q) @ Q". Since 1(Q) > i'M) = (i(M)) 
and /(Q) N Q" =0, i'(M) N Q" = 0. Since 7’ is an essential monomorphism, i'(M) is large in QO’. 
Hence QO” = 0. Then Q’ = /(Q) and / is an isomorphism of Q onto Q' such that /i=i'. o 


An injective module Q such that there exists an essential monomorphism i of M into Q is 
called an injective hull (or injective envelope) of M. Its existence and uniqueness is given in 
Theorem 3.19. The argument shows that if O' is any injective module and i’ is a monomorphism 
of M into Q, then there exists a monomorphism / of Q into Q' such that i’ = li. In this sense an 
injective hull provides a minimal imbedding of M into an injective module. 


EXERCISES 


1. Let D be a domain (commutative ring with no zero divisors + 0), F the field of fractions 
of D(F > D). Show that if M is a D-module, then Mp = F ®pM is a divisible D-module 


and x ~ 1 ®x is an essential monomorphism of M into Mp if M is torsion-free. 
2. Show that [[Q,, injective if and only if every Q, is injective. 


3. Show that a submodule M is large in the module N if and only if for any x # 0 in N there 
exists anr € R such that xr #0 and xr € M. 


3.12 MORITA CONTEXTS 


The remainder of this chapter is devoted to the Morita theory of equivalence of categories of 
modules. The principal questions considered in this theory are when are two categories mod-R 
and mod-S equivalent, how are such equivalences realized, and what are the auto-equivalences 
of mod-R? We showed in Chapter 1, pp. 29-31, that for any ring R and any positive integer n, 
mod-R and mod-M,„(R) are equivalent categories. We shall see (section 3.14) that this is 
almost the most general situation in which equivalence of mod-R and mod-S occurs. From the 
point of view of the applications, the machinery used to develop the results on equivalence of 
modules is more important than the results themselves. The central concept of this machinery is 
that of a Morita context (or “set of pre-equivalence data”). We shall begin with the Morita 
context, considering first an example. 

We consider a foursome consisting of 1) a ring R, 2) a right R-module M, 3) the dual M* = 
hom(M, R), and 4) R' = End M. We have seen that we can regard M in a natural way as an R’- 
R-bimodule (p. 133). Here, if 7’ € R' and x € M, r'x is the image of x under r’. We have also 
seen that M* becomes an R-R’-bimodule if we define 


(80) (ry*)x =r(y*x),  (y*r)x = y*(r'x) 


forre R,r'e R,y* € M*,x €e M (p. 135). It is a good idea to treat M and M* in a symmetric 
fashion. To facilitate this, we write (y*, x) for y*x = y*(x) and we consider the map of M* x M 
sending the pair (y*, x) into the element (y*, x) € R. We list the properties of this map: 


(y*, xX, +X2) = (y*,x,)+()*,X2) 
(y*, xr) = (y*, x)r 


(31) (yt +y3,X) = (yf,x) + (yz, x) 
(ry*, x) = r(y*, x) 
(y*r’, x) = (y*,r°x). 


The first two of these amount to the definition of a homomorphism from Mp to Rp and the third 


to the definition of the sum of two homomorphisms; the last two are the definitions (80). The 
first, third, and fifth of (81) show that (y*, x) + (v*, x) is an R’-balanced product of M* and M. 
Hence we have a homomorphism t of M*®p',M into R such that t(y*®x) = (y*, x). The second 


and fourth equations now read t(y*@xr) = (t(y*@x))r and t(ry*®x) = rt(y*@x). Hence if we 
regard M* = pM*,' and .M = p'Mp and, consequently, M*®p'M as an R-R-bimodule, then z is a 
bimodule homomorphism of M*®p'M into R. 

We define next a bimodule homomorphism u of M®@pM* into R'. Let (x, y*) be an element of 
M x M*. Then this defines a map 


(82) [x,y*]:y > x(y*,y) 


of M into M. Evidently, [x, y*](v) + y2) = Lx, y*]y, + [x, y*]y2 and [x, y*](vr) = (Lx, y*]y)r so 
[x, y*] € R'= End M. Moreover, we have 


[xi +x2,9*] = Lx y*] +P, 9") 
[x yf ty?) = Dot) +[x, y2] 
(83) [xr,y*] = [x,ry*] 
iray e= riy] 
wY eia hr, 


which follow directly from the definition (82) and from (81). It follows from (83) that we have 
an R'-R'-bimodule homomorphism u of M®,pM* into R’ such that u(x@y*) = [x, y*]. The 
definition (82) can be rewritten as 


(84) [x, y*]y = x(y*,y) 
and we have 

(85) x x,y" |] =e (x* x)", 
since 


(x*Lx,y*])y = x*(Lx, y*]y) = x*(x(y*, y)) = (x*x) (y*, y) = (x*, x) (y*, y) 
and 
((x*, x)y*)y = (x*, x) (y*y) = (x*,x) (y*,y). 


We shall now abstract the essential elements from this situation by formulating the 
following definition. 


DEFINITION 3.11. 4 Morita context is a set (R, R', M, M', t, u) where R and R' are rings, 
M = p'Mp is an R'-R-bimodule, M' = pM'g' is an R-R'-bimodule, t is an R-R-homomorphism 
of M'®;'M into R, and u is an R'-R'-homomorphism of M ®pM' into R' such that if we put t(x' 
® x) = (x', x) and u(x @ x’) = [x, x'], then 

(i) [x, yy = x0"). 

(it) x'Lxy'] = @', xy". 


We remark that conditions (i) and (ii) are equivalent respectively to the commutativity of 
the following diagrams: 


M Or M' Op M 1@r M@,R 


M @,. M' lOp M’ @, R’ 


R @, M’ M’ 


Here the unmarked arrows are the canonical isomorphisms of M®,R with M, of R'®ẹ. M with 
M, etc. 
The foregoing considerations amount to the definition of a Morita context (R, R' = End Mp, 


M, M*, T, 4) from a given right R-module M. We shall call this the Morita context defined by 
Mp. We now give a simpler 


EXAMPLE 


Consider the free left module “R and the free right module R™ as on p. 126. Put M = R™®, M' 
= R and denote the elements of M as columns 


Xi 


and those of M’ as rows x’ = (xX, X2, ..., x,). Let R' = M,(R), the ring of n x n matrices with 
entries in R. Then M is a left R’-module if we define r'x for r' € M,(R) to be the matrix 
product. Then M is an R'-R-bimodule. Similarly, M' is an R-R'-bimodule if we defined the 
action of r’ on x’ as the matrix product x'r'. Also we define [x,y] = xy’, the matrix product, as 
on p. 126. Then we have the homomorphism u of M®pM' into R' = M,(R) such that u(x @ y’) = 
[x, y'], and this is an R'-R’'-bimodule homomorphism. We define (y', x) = y'x. This is a 1 x 1 
matrix, so it may be regarded as an element of R. Then we have an R-R-bimodule 
homomorphism t of M' @p' M into R such that t’ ® x) = (y', x). The relations (i) and (ii) 
follow from the associative law of multiplication of matrices. Hence (R, R' = M,(R), R™, R, 
T, u) is a Morita context. 
IfM is a right R-module and M* = hom(M, R), then we put 


(86) T(M) = p vr cR. 


Evidently this is the subgroup of the additive group R generated by the values (y*, x) = y*(x), 
y* e M*, x€ M. T(M), which is called the trace of the module M, is an ideal in R, since ifr € 
R then 


(y*,x)r = (y*,xr)e T(M) 
r(y*, x) = (ry*,x)e T(M). 


We are now ready to prove the first main theorem on Morita contexts. This draws a large 
number of important conclusions from the hypotheses that t and u are surjective. The results 
are implicit in K. Morita’s basic paper on duality for modules. Subsequently the formulation 
was improved substantially by H. Bass and others, who called the result “Morita I.” To shorten 
the formulation we shall now call a module a progenerator if it is finitely generated projective 
and its trace ideal is the whole ring. The appropriateness of this terminology will appear in 
section 3.13. 


MORITA I. Let (R, R', M, M', t, u) be a Morita context in which t and u are surjective. 
Then 


(1) Mp, pM, M'g, pM’ are progenerators. 

(2) t and u are isomorphisms. 

(3) The map L: x' ~ I(x") where I(x') is the map y ~ (x', y) of M into R is a bimodule 
isomorphism of pM'g: onto pM *¥g: where M* = hom (Mp, Rp). We have similar 
bimodule isomorphisms of pM'p with hom (pM, pR'), of pMp with hom (pM’, RR), 
and of pMp with hom(M'p, R'p'). 

(4) The map i: r' ~ Ar’) where A(r’) is the map y ~ r'y is a ring isomorphism of R' 
onto End Mp = hom(Mp, Mp). Similarly, p : r ~ p(r) where p(r) is y ~ yr is a ring 
anti-isomorphism of R onto End pM and we have a similar isomorphism i' of R 
onto End M'g', and antiisomorphism p' of R' onto EndpM'. 

(5) The pair of functors @pM' and @pM define an equivalence of the categories mod- 
R and mod-R'. Similarly, M @p and M' ®p define an equivalence of R-mod and R 
'-mod. 

(6) IfI is a right ideal of R put 


(87) n(I) = IM' = {Yb,xi|b,e 1,x;€ M’} 


and if N' is a submodule of M'g put 


(88) &(N’) = (N', M) = {S(yi.x)) | ye N', x€ M}. 


Then y and €, are inverses and are lattice isomorphisms between the lattice of right ideals 
of R and the lattice of submodules of M'g. Moreover, these induce lattice isomorphisms 


between the lattice of (two-sided) ideals of R and the lattice of submodules of pM'p. Similar 


statements hold for the lattices of left ideals of R, of left ideals of R', and of right ideals of R 
" These imply that R and R' have isomorphic lattices of ideals. 


(7) The centers of R and R' are isomorphic. 


Proof. (1) Since u is surjective, we have u; € M, v’;€ M', 1 <i < m, such that ru.) = 1 (the 
unit of R’). Then if x e M, XEM, x = lx = Ffu, vi]x = Lue), Now the map I(v'): x +(v';, x) is in 
hom (Mp, R). Hence it follows from the dual basis lemma (Proposition 3.11, p. 152) that Mp is 
finitely generated projective. The hypothesis that t is surjective means that any r € R can be 
written as "= Lx MEM. xEeEM. Then "= LOD GET) and T(M) = R. Thus Mp is a 
progenerator. The other three statements follow by symmetry. 

(2) Suppose L*@yekert, so E(x») =, Since t is surjective, we have W, € M,z'; € M' 
such that Xw) = 1 (in R). Then 


EXO: =) HOvlzj,w) =) xel zw; 
I,J I,J 
= ¥ xi[y,z)]Ow; = } (x), y)zj@w, = 0. 
ij ij 


Thus ker t = 0 and 7 is an isomorphism. Similarly, u is an isomorphism. 

(3) We have noted that /(y’):x ~ (y', x) is in hom (Mp, R). Direct verification, using the 
definition of the actions of R and R’ in hom (Mp, R), shows that y’ ~ /(y’) is an R-R'-bimodule 
homomorphism of M' into hom (Mp, R). Suppose /(y’) = 0, so (9, x) = 0 for all x. As before, 
we can write |= Elsti] weM, 6EM Then y = Ly’ Lue] = Lowe =0, Hence 1: y' ~ 10 is 
injective. Now let x* € hom(Mp, R) and put Y = Lee: where Elsti] = 1, Then 


2 ((x*uj)u;, x) = ny (x*u;)(v;, x) 
x*(Yufv;,x)) = x*(Y[u;, vj ]x) = x*(x). 


Hence x* = /(y’) and / is surjective. The other three assertions follow by symmetry. 

(4) Since M is an R’-R-bimodule, A(r’):x ~ r'x is an endomorphism of Mp. It is clear that 2 
is a ring homomorphism of R' into End Mp. Now suppose /(r’) = 0. Then r'x = 0 for all x € M; 
hence, " ="! = rfu] where Ll“) =! (in R), and "LL ei) = Lr Lu. e] = E[u] = 0, Thus r 
'=0 and so 4 is injective. Now let fe End Mp. Then ” = LL 4.0 R’ and 


(yx) 


r'x = F| fu, v;|x = Y fue; x) = fF uv, x) 
= f(E [u;, v;]x) = f(x). 


Hence f= A(r’) so A is surjective. Thus 4 is an isomorphism of rings. The other cases follow in 
a similar fashion. 

(5) If N is a right R-module, then N @p M' is a right R'-module and if N’ is a right R’- 
module, then NV’ @p, M is a right R-module. Hence we obtain the functors ®@pM' and ®@p M from 
mod-R to mod-R’ and from mod-R’ to mod-R respectively. Iteration gives the functors ®p M' ® 
pr M and ®ẹp M @p M' from mod-R to itself and from mod-R’ to itself. Now we have the 
associativity isomorphism of (NV ®@p M')@p M with N @,(M' g M). Following this with the 
isomorphism t of M' @p, M onto R we obtain an isomorphism of N @p (M' g M) onto N @p R. 
Applying the canonical isomorphism of N @p R to N and combining all of these we obtain a 
right R-module isomorphism 


(89) (N@pM')@pM >N. 


Since all of the intermediate isomorphisms that we defined are natural in N, (89) is natural in 
N. Hence ®p M' ®p M is naturally isomorphic to the identity functor 'mod-R’. Similarly, ®p M 
®p M' is naturally isomorphic to ‘mod-R’. Hence ®@p M', @p M provide a natural equivalence 
between the categories mod-R and mod-R’. The other equivalence asserted in (5) follows by 
symmetry. 

(6) It is clear from the definition that if Z is a right ideal of R, then 7(/) is a submodule of M 
'g. Moreover, if I is an ideal, then 7(/) is a submodule of pM'p. Similarly, if N’ is a submodule 
of M'p, then ((N’) is a right ideal of R and if N’ is a submodule of pM'p, then ¢(N’) is an ideal 
of R. If J is a right ideal of R, then ¢(7(/)) is the set of sums of elements of the form (by’, x) = 
b(y', x) where b € I, y'e M',x € M. Since Tt is surjective, this set is 7. Thus, ¢7 is the identity 
map on the lattice of right ideals. If N’ is a submodule of pM'p then y(n(N')) is the set of sums 
of elements of the form (y’ ,x)x', y' € M,x €e M, x’ e M'. Since (y' ,x)x' = y' [x, x'] and u is 
surjective, we have 7(C(N’)) = N'. It is clear also that y and ¢ (are order-preserving for the 
order given by inclusion. Hence these are isomorphisms between the lattices of right ideals of 
R and of submodules of M'ẹ (see BAI, p. 460). Evidently, these isomorphisms induce 
isomorphisms between the lattices of ideals of R and of submodules of pM'p. The other 
statements follow by symmetry. In particular, we have a lattice isomorphism 
l ~œ MT = {Xxibilbie l, xi€ M't of the lattice of ideals 7' of R' with the lattice of submodules of the 
bimodule pM’. Combining isomorphisms, we see that given any ideal Z of R there is a unique 
ideal /' of R' such that JM' = M'l' and I ~ T' is an isomorphism of the lattice of ideals of R onto 
the lattice of ideals of R’. 

(7) To establish an isomorphism between the center C(R) and the center C(R’) we consider 
the two rings of endomorphisms End Mp and End pM. Both of these are subrings of End M, the 


ring of endomorphisms of M regarded just as an abelian group. Now each of the rings End Mp, 
End pM is the centralizer of the other in the ring End M: By (4), End Mp is the set of maps y ~ 
r'y, r' € R', and End pM is the set of maps y ~ yr. On the other hand, the definition of End Mp 


shows that this subring of End M is the centralizer of the subring consisting of the maps y ~ yr. 
Similarly, End pM is the centralizer in End M of the set of maps y ~ r'y. Hence each of End 


Mr; End pM is the centralizer of the other in End M. Clearly this implies that the center 


Since we have an isomorphism of End Mp with R’ and an anti-isomorphism of End pM with R, 
this gives an isomorphism of C(R) with C(R’). o 


We shall now show that Morita I is applicable to any Morita context (R, R' = End Pp, P, P* 
= hom (Pp, R), t, u) determined by a progenerator P = Pp. Here t and u are as defined at the 
beginning of the section. The hypothesis that P is a progenerator includes the condition 7(P) = 
R and this means that t is surjective. Since P is finitely generated projective, by the dual basis 
lemma, we have x; e P, x;* e P*, 1 <i <m, such that x= Lxix?*) for any x in P. Then 
x = Exdxž. x = Efx xt] x = (Sixx x. Thus ELX] = lr, which is the unit element of R' = End 
Pp. It follows that any r’ € End Pp has the form " = 71e = E[rx.x*] and so u is surjective on R’. 
We therefore have 


THEOREM 3.20. Jf P is a progenerator in modR, then u and t are surjective for the 
Morita context (R, R' = End Pp, P, P* = hom(Pp, R), t, u). Hence Morita I is applicable. 


EXERCISES 


1. Show that Morita I is applicable in the example given on pp. 166—167. Interpret the 
results of the theorem in this case as theorems on matrices. In particular, use the 
correspondence between the lattice of ideals of R and of R’ given in the proof of Morita I 
(6) to show that the map B ~ M,(B) is an isomorphism of the lattice of ideals of R onto 


the lattice of ideals of M,(R). (This was exercise 8, p. 103 of BAI.) 


R M ax 
2. Let (R, R' ,M, M' t, u) be a Morita context. Let (wR be the set of matrices p v) where 
ae R, b'e R', x'e M'. Define additional component-wise (usual matrix addition) and 
multiplication by 


(" “ll "i a+ (Xi Y2) ax} +455 
yı bi/\y2 b3, Vid, +biya [yx] +bb,/ 


Verify that this addition and multiplication together with the obvious 0 and 1 constitute a 


ring. We shall call this the ring of the Morita context (R, R' ,M, M' t, u). Let B be an 


i B BM'+M'B 
ideal in R, B' an ideal in R’. Verify that the set of matrices (mime e O) is an ideal in 
the foregoing ring. 


3. Consider the special case of exercise 2 in which M' = 0, t = 0, u = 0. Show that the 
resulting ring is isomorphic to the ring defined as in exercise 1, p. 136, regarding M as a 
T-T-bimodule relative to T= R x R' as in exercise 2, p. 136. 


3.13 THE WEDDERBURN-ARTIN THEOREM FOR SIMPLE RINGS 


We shall now apply Morita I to derive a classical structure theorem for simple rings—the 
Wedderburn-Artin theorem. This was proved for finite dimensional algebras over a field by 
Wedderburn in 1908 and for rings with descending chain condition on one-sided ideals by 
Artin in 1928. In the next chapter we shall integrate this theorem into the general structure 
theory of rings. Here we treat it somewhat in isolation and we formulate the result in the 
following way. 


WEDDERBURN-ARTIN THEOREM FOR SIMPLE RINGS. The following conditions 
on a ring R are equivalent: 


(1) R is simple and contains a minimal right (left) ideal. 


(2) R is isomorphic to the ring of linear transformations of a finite dimensional vector 
space over a division ring. 


(3) R is simple, left and right artinian, and left and right noetherian. 


Proof. (1) = (2). Let J be a minimal right ideal of R, so J = Jp is an irreducible right R- 
module. We show first that Jp is finitely generated projective. Consider £! = Ler“! This is an 
ideal in R containing 7. Hence RI = R. Then ! = }"ab: where a; € R, b; € I. This implies that 
R = "a, Observe next that since for a €e R, the map x ~ ax, x € J, is a homomorphism of 7 as 
right module, by Schur’s lemma either al = 0 or al = I as right module. Then R = R = La! is ê 
sum of irreducible right modules. Then by Thoerem 3.9 (p. 117), R =7@® T where /' is a second 
right ideal. This implies that 7 is projective and can be generated by a single element. By the 
dual basis lemma, the trace ideal 7(/) # 0. Since R is simple, we have T(J) = R and J is a 
progenerator. Hence Morita I is applicable to the Morita context (R,R’ = End Jp, J, /* = hom 


(Ir, R), t, u). By Schur’s lemma, R’ is a division ring. By Morita J, /* is a finitely generated 
right module over the division ring R’ and R is isomorphic to End /*». If we replace R’ in the 
usual way by its opposite A = R’°P, then /* becomes a finite dimensional (left) vector space 


over A and R is isomorphic to the ring of linear transformations of this vector space. The proof 
for J, a minimal left ideal, is similar. 


(2) = (3). Let V be a finite dimensional vector space over a division ring A and let L be its 


ring of linear transformations. Put R = A” and regard V as right module over R. Then Vp is free 


with finite base so it is finitely generated projective. Moreover, if x # 0 in V, then there is a 
linear function f such that f(x) = 1. This implies that the trace ideal 7(Vp) = R and so Vp is a 
progenerator. Then Morita I applies to (R, L = End Vp = End, V, V, V= = hom (Vp, R), T, y). 
Then Morita I (6) shows that Z and the division ring R have isomorphic lattices of ideals. 
Hence L is simple. Also this result shows that the lattice of left ideals of L is isomorphic to the 
lattice of submodules of V* and the lattice of right ideals is isomorphic to the lattice of 
submodules of V. Since finite dimensional vector spaces satisfy the descending and ascending 
chain conditions for subspaces, it follows that L is left and right artinian and noetherian. 
(3) + (1) isclear. O 


— 


EXERCISES 


. Use Morita I (6) to show that if L is the ring of linear transformations of a finite 


dimensional vector space V over a division ring, then L acts irreducibly on V (that is, V 
regarded in the natural way as left Z-module is irreducible). 


. Let V be a finite dimensional vector space over a division ring A, V* the right vector 


space of linear functions on V. Ifx e V and fe V*, write &f) =f). If U(U*) is a 
subspace of V(V*), put 


U+ = {feV*|<y,f> =0,yeU} (U* = {ye V|<y,g) = 0,geU*}). 


Show that the maps U~U’,U*~+U* are inverses and hence are anti isomorphisms 
between the lattices of subspaces of V and V*. 


. Let V and L be as in exercise 1. Use Morita I (6) and exercise 2 to show that the map 


a:U ~a(U) = {le L|(U) = 0! 


is an anti-isomorphism of the lattice of subspaces of V onto the lattice of left ideals of L. 


. Show that the map 


I~ann,J = {leL|lI = 0) 


is an anti-isomorphism of the lattice of right ideals of L onto the lattice of left ideals of 
L. 


. Show that any left (right) ideal of L is a principal left (right) ideal generated by an 


idempotent (that is, has the form Le, e? = e or eL, e? =e). 


. Show that if 7 e L, then there exists a u e L such that lul = l. 


3.14 GENERATORS AND PROGENERATORS 


If R is a ring and M is a right R-module, any x e M is contained in the submodule xR. Hence 
M => su XR and xR is a homomorphic image of R = Rp. We shall now call a right R-module X a 
generator of the category mod-R if any module M is a sum of submodules all of which are 
homomorphic images of X. Thus R is a generator for mod-R. Evidently, if X is a generator and 
X is a homomorphic image of Y, then Yis a generator. Since a homomorphic image of X” is a 
sum of homomorphic images of X, it is clear also that if XY”) is a generator, then X is a 
generator. The concept of generator will play a central role in the study of equivalences 
between categories of modules. The following theorem gives a number of important 
characterizations of generators. 


THEOREM 3.21. The following conditions on a module X are equivalent: 


(1) Xis a generator. 
(2) The functor hom(X, —) is faithful (see p. 22). 
(3) The trace ideal T(X) = R. 


(4) There exists ann such that R is a homomorphic image of X”. 


Proof. (1) = (2). We have to show that for any two right R-modules M and N, the map f ~ 
hom(X, f) of hom (Mp, Np) into the set of homomorphisms of hom(X, M) into hom (X, J) is 
injective. Here hom (X f) is the map g ~ fg of hom (X, M) into hom (X, N). Since hom LX, f) is 
a homomorphism, it suffices to show that if f # 0 then the map hom LX, f) of hom (X, M) into 
hom (X, N) is # 0. This means that we have to show that for a given f # 0, fM — N, there 
exists a g € hom (X, M) such that fg + 0. Suppose this is not the case. Then fg = 0 for every g 
e hom (X, M). Since X is a generator M = X} gX where the summation is taken over all 
homomorphisms g of X into M. Then JM =/(L9X) = LigX = 9, contrary to fF 0. 

(2) = (3). Let M = R, N= R/ T(X), and let v be the canonical homomorphism of M into N. 
Given any g € X* = hom (X, M), then g(x) € T(X) for every x e X. Hence vg(x) = 0. Thus vg = 
0 for every g e hom (X, M). By hypothesis, this implies v = 0. Then N = 0 and R = 7(X). 

(3) = (4). If 3 holds, we have ! = Efix; for x€ X, feX*, Now consider X” and the map 
(Visies Ym) > ETSY Of X of X into R. This is a homomorphism of right R-modules, so its 
image is a right ideal of R. Since ! = L/ is in the image, the image is all of R. Thus we have 
an epimorphism of X) onto R. 


(4) = (1). Since R is a generator and R is a homomorphic image of X”, it follows that X0” 
is a generator. Then X is a generator. O 


The characterization 3 of generators shows that a module X is a progenerator in the sense 
defined in section 3.12 if and only if X is finitely generated projective and X is a generator of 
mod-R. 


We shall call a module M faithful if the only a such that Ma = 0 is a = 0. For any module 
M we define annpV = {b e R | Mb = 0}. It is clear that this is an ideal in R and that M is 


faithful if and only if annpM = 0. It is clear also that R regarded as the module Rp is faithful 
and it follows from Theorem 3.21.4 that any generator X of mod-R is faithful. For, if R is a 
homomorphic image of some X”) then annpX¥ c annpR = 0 so X is faithful. 

In the important special case in which the ring R is commutative, condition(3) of Theorem 


3.21 can be replaced by the simpler condition that X is faithful. To prove this we require the 
following 


LEMMA. Let R be a commutative ring, M a finitely generated R-module. An ideal I of R 
satisfies MI = M if and only if I + annpM =R. 


Proof. (1) Suppose J + annpM = R. Then 1 = b + c where b € I, c e€ anngM. Any x € M can 
be written as x = xl = xb + xc = xb e MI. Thus MI = M. (Note that finite generation is not 
needed for this part.) (2) Let x), ...,x, generate M. The condition MI = M implies that any x € 
M has the form Li%Ps bel. In particular, ¥i = Lj=1Xjbj 1 Si <n, or Vj= xX (ð; — bi) = 0, 
These equations imply xjdet(! — B)=0, 1<j<n, B= (b;), Evidently c = det(1 —B) = 1 — 
b, b e I. Since R is commutative, xe = 0, 1 <j <n, implies that xc = 0 for all x € M so c € 
anngM. Then / = b + c e I + annęgM and hence R = J + anngM. O 


We can now prove 


THEOREM 3.22. Any faithful finitely generated projective module over a commutative 
ring is a generator (hence a progenerator). 


Proof. IfM is finitely generated projective, by the dual basis lemma, we have elements x, ..., 
x, E M and elements fi ...,f, € homp(M, R) such that * = Lx.) Then f(x) € (M), so this 
shows that M = MT(M). Hence, by the lemma, R = anngM + T(M) and so if M is faithful, then 
R = T(M). Hence M is a generator by Theorem 3.21. O 


In considering equivalences between mod-R and mod-R’ for a second ring R' we assume, of 
course, that the pair of functors (F, G) defining the equivalence are additive. We recall that F 
is faithful and full (Proposition 1.3, p. 27). Hence for any R-modules M and N, the map F of 
homg(M,N) into homp (F'M,FN) is an isomorphism. Moreover, F respects the composition of 


homomorphisms. It follows that properties of an R-module or an R-homomorphism that can be 
expressed in categorical terms carry over from mod-R to the equivalent mod- R’. For example, 
f:M — N is injective (surjective) if and only if/is monic (epic) in mod-R. Hence f is injective 
(surjective) if and only if Ff) : FM — FN is injective (surjective). 

The concept of a subobject of an object in a category (p. 18) provides a categorical way of 
dealing with the submodules of a given module N. In mod-R, a subobject of the module N is an 
equivalence class [f] of monies f: M — N where the equivalence relation is defined by f ~f : 


M' — N if there exists an isomorphism g: M' — M such that f’ =fg. In this case, f M' = fM, so 
all of the f in [f] have the same image in M and this is a submodule. Moreover, if M is any 
submodule, then we have the injection i :M — N, which is a monic, and iM = M. Thus we 
have a bijection of the set of submodules of N with the set of subobjects of N. This is order- 
preserving if submodules are ordered in the usual way by inclusion, and we define [f] < [/] for 
subobjects of N to mean f' = fg for a monomorphism g. If {N,} is a directed set of submodules 
of N, then U, is a submodule and this is a sup for the set of N, in the partial ordering by 
inclusion. It follows that any directed set of subobjects of N has a sup in the ordering of 
subobjects. If (F, G) is an equivalence of mod-R to mod-R’ and {[f,]} is a directed set of 
subobjects of N with sup [f], then it is clear that {[F(f)]} is a directed set of subobjects of FN 
with sup {[F,]}. A subobject [f] is called proper if fis not an isomorphism or, equivalently, if 
fM for f:M — N is a proper submodule of N. If [f] is proper then [F(/)] is proper. 

We wish to show that if X is a progenerator of mod-R, then FX is a progenerator of mod-R’. 
This will follow from 


PROPOSITION 3.19. Let (F, G) be a pair of functors giving an equivalence of mod-R and 
mod-R’. Then (1) If X is a generator of mod-R then FX is a generator of mod-R'. (2) If X is 
projective then FX is projective. (3) If X is finitely generated then FX is finitely generated. 


Proof. (1) By Theorem 3.21.2, X is a generator if and only if for f:M — N, f + 0, there exists 
a g:X — M such that fg + 0. Now consider FX and let f:M' — N’, f + 0, in mod-R’. Then G(/’) 
+ 0 so there exists a g-X GM' such that G(f’)g + 0. Then FG(/’)F(g) + 0. The fact that FG = 
lmoq-g' implies the existence of a g’: FX — M' such that f'g' + 0 (draw a diagram). Hence FX 
is a generator in mod-R’. 

(2) Since surjectivity of a homomorphism f is equivalent to the condition that f is epic in 
mod-R, the statement that X is projective is equivalent to the following: given an epic p.M —> 
N, the map g ~ pg of homp(X, M) into homp(X, N) is surjective. From this it follows easily 
that X projective implies FX is projective. We leave it to the reader to carry out the details of 
the proof. 

(3) We note first that X is finitely generated if and only if it cannot be expressed as a union 
of a directed set of proper submodules (exercise 3, p. 60). The categorical form of this 
condition is that the sup of any directed set of proper subobjects of X is a proper subobject. 
Because of the symmetry of equivalence, it suffices to show that if {[/,]} is a directed set of 
proper subobjects of an R-module N such that [f] = sup {[/,]} is not proper, then the same thing 
holds for {[F(/,)]} as subobjects of FN. This is clear from the remark made above. 0 


EXERCISES 


1. Show that Proposition 3.19 holds for the property of injectivity of a module. 
2. Let (F, G) be as in Proposition 3.19. Show that if M is noetherian, then FM is 


noetherian. What if N is artinian? 

3. Amodule X is called a cogenerator of mod-R if for any module M there exists a 
monomorphism of M into a product []_X of copies of X. Show that X is a cogenerator if 
and only if the functor hom (—, X) is faithful. 


4. Determine a cogenerator for mod-2. 


3.15 EQUIVALENCE OF CATEGORIES OF MODULES 


Before proceeding to the next main result, which will give a condition for equivalence of the 
categories mod-R and mod-R’ for two rings R and R’ and the form of such equivalences, we 
shall establish a natural isomorphism between two functors that arise in the situation of Morita 
I. 

We recall that if P = pPp and M = Mp (hence = Z Mp) then, as in Proposition 3.4, hom (Pp, 
Mp) can be regarded as a right R’-module by defining fr’ for fe hom(Pp, Mp), r' € R' by (fr')x 
= f(r'x) (see p. 134). This defines a functor hom (Pp, —) from mod-R to mod-R’. We shall now 


show that if we have a Morita context (R, R', P, P', t, u) in which t and t and w are surjective, 
as in Morita I, then hom (Pp, —) and ® pP’ are naturally isomorphic functors. Let u e M, z' e P 


'. Then we define the map 
(90) {u,2'}:2 > u(z’, z), zeP 


of P into M. It is clear that {u, Z'} € hom (Pp, Mp) and the map (u, z') ~ {u, z'} defines a 
balanced product of Mp and pP’. Hence we have the homomorphism of M® p P' into hom (Pa, 
Mp) such that u® z ~ {u, zt. If r e R, then {u, z’}r’ is defined by 
(m2 jr ye = (ue hrs = ulz’ r'z) = uzr) Thus {w7} = {uzr} so the homomorphism such that 
u®7 > {u7} is a right R'-module homomorphism. We claim that this is an isomorphism. As in 
the proof of Morita I, we write ! = Elu» v] for ue P, ije P’. Then if fe hom (Pp, Mp) and z € P, 
we haye F =S02 = fOleLu; v;]2) =S Cukr; 2) = ES luM z) = Ti funo? Thus f=5{fi, vi}. Hence 
the homomorphism of M@pP’ into hom (Pp, Mp) is surjective. Now suppose $ {w;, w’;} = O for 
w,; € M, w3 € P'. Then 


Ywi@w; = Yw,@w,l = 2 wi@wilyj, vj] 


i,j 


= » W,@(w;, ujv; = y ww; Uj) Qu; 
Fy a 


ij 
=) {wp wiuj@v; = 0. 
ij 
Hence our homomorphism is injective. Thus we have the right R'-module isomorphism 
(91) nn:M®prP' — hom (Pr, Mp) 


such that yy(u® z') = {u, z'}. This is natural in M, that is, iff € hom (Mp, Np) then 


M ®, P’ "hs hom (P,.M,) 
fol hom (P,f) 
N®,P’ Ny hom /Pr: Ng) 


is commutative. To verify this let u ® z e M ®, P. Then hom 
(P.f)ny(u@z’) =f {u2} = {fuz} (by the definition (90)) and 


nn f®1)(u®z') = {fu,z'}. 


Hence the diagram is commutative. We have therefore shown that the functors ®pP’ and hom 
(Pp, —) are naturally isomorphic. 
We can now prove the second main result of the Morita theory. 


MORITA II. Let R and R' be rings such that the categories mod-R and mod-R’ are 
equivalent. Then 


(1) There exist bimodules g: Pp, pP'p and a Morita context (R, R', P, P' t, u) for which t 
and u are surjective so Morita l holds. In particular, R' is isomorphic to End Pp for 
a progenerator P of mod-R and R is isomorphic to End Pp, P'a progenerator of mod- 
R'. 

(2) If (F,G) is a pair of functors giving an equivalence of mod-R and mod-R', then F is 
naturally isomorphic to ®g P' and G to @p'P where P and P' are as in (1). 


Proof. (1) Let (F, G) be as in (2) and put P = GR', P = FR. Since R is a progenerator of mod- 
R, P'is a progenerator of mod-R' by Proposition 3.19. Similarly, P is a progenerator of mod-R. 
Since R’ is a ring, the map å: r' ~ A(r’) where A(r’) is the left multiplication x' ~ r' x’ is an 


isomorphism of R’ onto End R’p. Since G applied to End R = hom (R’p, , R’p,) is an 
isomorphism of this ring onto hom (Pp, Pp) = End Pp, we have the ring isomorphism 7’ ~ GA(r 
') of R' onto End Pp. It is clear that r'x = GA(r')x makes P an R'-R-bimodule. Similarly, r ~ 
FA(r) is an isomorphism of R onto End P’p, and P’ is an R-R-bimodule. Similarly, rx’ = FA(r)x' 
. We have the chain of group isomorphisms 


Pg = hom(R’, Pr) = hom(R’, FR) = 
hom (Pr, GFR) = hom (Pp, R) = P* 


where the first is the canonical one (exercise 1, p. 99), the second is G, and the third is g ~ 
Corrg Where C is the given natural isomorphism of GF to lyog-r All of the modules R’ P'p, Pp, 


R and GFR in (92) except GFR are bimodules and GFR becomes an R-R-bimodule if we 
define rx = GFA(r)x forr e R, x € GFR. With these definitions, all of the homs in (92) have R- 
R'-bimodule structures given by Proposition 3.4 (p. 134). Now it is immediate that the 
canonical isomorphism of P’p and hom (R’p3p) is an R-R'-bimodule isomorphism. The left R- 
action on hom (R', FR) is rf = FrX(r)f and the right inaction is fr' = fA(r'). Applying G we obtain 
G(rf) = GFA(r)G(f) and G(r) = G(f)GA(r'). Taking into account the R and R’ actions on hom 
(Pp, GFR) we see that the second isomorphism in (92) is an R-R' -bimodule isomorphism. 


(92) 


Similarly, if we use the naturality condition SureGFA(") = 4("Xcrm, we can show that the last 
isomorphism in (92) is R-R'. Thus we obtain an R-R’-bimodule isomorphism of pP’p with pP*p 
n If we apply this to the Morita context (R, R', P P*, — — ) determined by P, we obtain a 
Morita context (R, R', P, P’, t, u ) in which t and u are surjective. Then Morita I and the result 
noted at the beginning of the section are valid. 

(2) IfM is a right R-module, we have FM = hom (R'p, FM) = hom (GR'p, ,GFM) (by G) = 
hom (P,M) (by the natural isomorphism of GF and lnod-p) = M@pP' (as above), and all of the 
isomorphisms are natural in M. Hence the given functor F is naturally isomorphic to @pP’ . 
Similarly, G is naturally isomorphic to @pP. O 


The existence of a Morita context (R, R’ P, P’, t, u) with t and u surjective implies the 
equivalence of mod-R and mod-R’ and the equivalence of R-mod and R'-mod (Morita I (5)). A 
consequence of this result and Morita II is that R and R’ have equivalent categories of right 
modules mod-R and mod-R’ if and only if the categories R-mod and R’ -mod are equivalent. If 
this is the case, we say that R and R’ are (Morita) similar. Morita II and the characterization of 
generators in Theorem 3.21 (p. 173) permit a precise identification of the rings R’ which are 
similar to a given ring R, namely, we have 


THEOREM 3.23. For a given ring R let e be an idempotent in a matrix ring M,(R), n = 1, 
such that the ideal M,,(R)eM,,(R) generated by e is M,R). Then R' = eM,(R)e is similar to R. 
Moreover, any ring similar to R is isomorphic to a ring of this form. 

We shall require the following lemma, which collects a good deal of useful information. 


LEMMA. (1) If e is an idempotent in R and N is a right R-module, then homp(eR,N) is the 
set of maps ea ~ uea, u E N. The map ç~ fle) is a group isomorphism of homp(eR,N) onto Ne 
and this is a ring isomorphism of EndpeR = homp(eR, eR) onto eRe if N= eR. (2) The trace 


ideal T(eR) = ReR. (3) A right R-module M is a cyclic progenerator if and only if M = eR 
where e is an idempotent in R such that ReR = R. (4) If M = M,® M, for right R-modules 


and e, are the projections determined by this direct sum, then EndpM, = e({EndpM)e;,. 


Proof1) Since e is a generator of eR, a homomorphism f of eR into N is determined by the 
image fle) € N. Then fe) = fle’) = flee, so fle) = ue € Ne and flea) = uea. Conversely, for 
any u € N, the map ea ~ uea is a module homomorphism of eR into N. Direct verification 
shows that f ~ f(e) = ue is a group isomorphism of hom (eR, N) onto Ne and in the special case 
in which N = eR, this is a ring isomorphism (eRe is a subring of R and has e as unit). 

(2) The determination of hom,(eR, N) for the case N = R shows that (eR)* = hom p(eR,R) is 
the set of maps ea ~ bea, a,b e R. Hence 7(eR) is the set of sums )'b,ea; a, b; € R. Thus 
T(eR) = ReR. 

(3) It is clear that M is a cyclic projective right R-module if and only if M is isomorphic to 
a direct summand of R, hence, if and only if M = eR where e? =e e R. By Theorem 3.21, eR is 
a generator if and only if 7(eR) = R. Hence this holds if and only if ReR = R. Combining these 
results we see that M is a cyclic progenerator if and only if M = eR where e is an idempotent 
in R and ReR = R. 

(4) Let M = M, ® M, and let e, i = 1, 2, be the projections determined by this 
decomposition. Then e; € EndpM, e; + e, = l, e?; = e; and eje, = 0 = ee}. e\(EndpM)e, is a 
ring with the unit e; Ify € EndpM, eye; maps into itself and hence e ye,|M, € EndpM, Since 
(e;ne,)M, = 0, the ring homomorphism €e: > (¢v"¢:)!M1 is a monomorphism. Moreover, this 
map is surjective, since if C e EndpM, then e,Ce, € EndpM and e,Ce,|M, = ¢ Thus 
eine, (ene )IMi is an isomorphism of e,(End pM)e, onto EndpM,. Similarly, we have an 
isomorphism of e,(EndpM)e, onto EndpM). O 


We can now give the 


Proof of Theorem 3.23. We gave a direct proof of the equivalence of mod-R and mod-M,(R) 
in Proposition 1.4 (p. 29). A better approach to this is obtained by applying Morita I to the 
example of the left module M' = R of rows of n elements of R and the right module R™ of 
columns of n elements of R as in the example on p. 166. It is readily seen that the pairings u 
and t are surjective, so Morita I is applicable. Statement (5) of this theorem gives an 
equivalence of R and M,(R). Hence R and M,(R) are similar. If e is an idempotent in M,(R) 


such that M,(R)eM,(R) = M,(R) then, by (3) in the lemma, eM,(R) is a progenerator of mod- 
M,(R). Then, by Theorem 3.20, M,(R) and Endy (ry(eM,,(R)) are similar. By statement (1) of 


the lemma, the latter ring is isomorphic to eM,„(R)e. Thus M,(R) and eM,(R)e are similar and 


hence R and eM,(R)e are similar if M„(R)eM,(R) = M,(R). 

Conversely, let R’ be any ring similar to R. By Morita II, R’ = Endpp where P is a 
progenerator for mod-R. Then P is a direct summand of R” for some n and T(P) = R. Thus P = 
eR) where e? = e e EndpR”. Hence, by (4) of the lemma, End Pp = e(End R™)e. Also the 
application of Morita I to the example (R, M,,(R), RO) ™ R, t, u) shows that End RO” is the set 
of left multiplications of the elements of R” by n x n matrices with entries in R and End R™ = 
M,„(R). Identifying e with the corresponding matrix, we see that R' = eM,(R)e. Since P is a 
generator, we have T(P) = R and since P is a direct summand of R™, the elements of P* = hom 
(P, R) are the restrictions to P of the elements of hom (R™), R). The latter are the set of left 
multiplications by rows (aj, az, ... , d,), 4; € R. Since P = eR”, it follows that T(P) is the set 
of sums of elements of the form 


a;, b; € R. Hence 1 can be expressed as such a sum. It follows that M„(R)eM,(R) contains the 


matrix e,; whose (1-1)-entry is 1 and other entries are 0. Since M,(R)e,,M,,(R) = M,(R), we 
have M_,(R)eM,,(R) = M,,(R). Thus R' = eM,„(R)e where e? = e and M,,(R)eM,,(R) = M,(R). D 


Having settled the question of what the rings similar to a given ring look like, we now study 
more closely the equivalences of mod-R and mod-R' (and R-mod and R’-mod). By Morita II, 
these are given up to natural isomorphism by tensoring by an R’-R-bimodule P in a Morita 
context (R, R’! P P’ t, u) in which t and wu are surjective, hence, isomorphisms. Thus u is an R’- 
R'- bimodule isomorphism of P@pP’ onto R' and t is an R-R-bimodule isomorphism of P’®p,P’ 
onto R. Conversely, suppose we have such isomorphisms for an R’-R-bimodule P and an R-R'- 
bimodule P' for any pair of rings R and R’. Then, as in the proof of Morita I (5), ®@pP’ and @p-P 


are functors giving an equivalence of mod-R and mod-R’ and P®p and P’'®p give an 
equivalence of R’-mod and R-mod. Hence R and R’ are similar. Also since an equivalence of 
module categories sends progenerators into progenerators and R®;pP'=P’ it is clear that P’ is a 
progenerator for mod-R’. Similarly, it is a progenerator for R-mod and P is one for mod-R and 
for R'-mod. We shall now call an R'-R-bimodule P invertible if there exists an R-R'-bimodule 
P’ such that P'®pP = R as R-R-bimodule and P®p P’ = R' as R'-R'-bimodule.One can tensor 
multiply such modules. More precisely, if R” is a third ring and Q is an invertible R"-R’- 
bimodule (so R” and R' are similar), then the associative law for tensor products shows that Q 
®,p P is invertible. We can now relate this to isomorphism classes of functors giving 
equivalences between the categories of modules of two rings. This is the content of 


MORITA IIL. Let R,R',R",... be similar rings. Then the map P ~ Bp P defines a bijection of 


the class of isomorphism classes of invertible R'-R-bimodules and the class of natural 
isomorphism classes of functors giving equivalences of mod-R' and mod-R. In this 
correspondence, composition of equivalences corresponds to tensor products of invertible 
bimodules. 


Proof. The first statement amounts to this: If P is an invertible R'-R- bimodule, then ®@p P 


gives an equivalence of mod-R’ and mod-R and every such equivalence is naturally isomorphic 
to one of this form. Moreover, the functors ®p'P; for @p'P, for invertible R’-R-bimodules P, 


and P, are naturally isomorphic functors if and only if P, and P, are isomorphic as bimodules. 
The first assertion has been proved in the first two Morita theorems. Now suppose P, and P, 
are isomorphic invertible R’-R-bimodules. Then for any W’ = Mp, M'@pP, and N'@p'P> are 
isomorphic under an isomorphism that is natural in N’. Hence @p'P; and @p'P> are naturally 
isomorphic. Conversely, suppose @p P, and ®p'P> are naturally isomorphic. Then we have an 
R-isomorphism y of R'®g' P; onto R'®pg' P, such that for any a’ € R' 


f(a’) © Ip. 


R' Oy P, R' @» P, 
n n 
R' Op P, ha) © lr R' ©, P; 


is commutative. This means that R’ ®p' P4 and R' @p' P, are isomorphic as R'- R-bimodules. 
Then P; and P, are isomorphic as R'-R-bimodules. Now suppose Q is an invertible R"-R'- 
bimodule. Then we have the functor ®p-O from mod-R” to mod-R' and the functor ®@pP from 
mod-R' to mod-R. The composite is the functor (®}"QO)®pP from mod-R” to mod-R. By 
associativity, this is the functor ®&p{QƏp P). This is the meaning of the last statement of the 
theorem. O 


In the special case in which R’ = R, the preceding theorem relates isomorphism classes of 
invertible R-R-bimodules and natural isomorphism classes of auto-equivalences of mod-R. It 
is readily seen that the first of these classes is a set. (Prove this.) Hence it constitutes a group 
in which the multiplication is defined by the tensor product. The theorem gives an isomorphism 
of this group with the group of natural isomorphism classes of auto-equivalences of mod-R. 
The first group is denoted as Pic R. Of particular interest is the case in which R is 
commutative. We shall consider this group later (sections 7.8 and 10.6). 


EXERCISES 


1. Let e be an idempotent in a ring such that ReR = R. Show that the map B ~ eBe =B N 
eRe is a lattice isomorphism of the lattice of ideals of R onto the lattice of ideals of eRe. 


Does this hold without the assumption that ReR = R? 


2. Define a relation ~ among rings by R ~ R' if R' = eM,(R)e for some n and an idempotent 
e such that M_,(R)eM,(R) = M,(R). By Theorem 3.23 this is an equivalence relation. 
Can you prove this directly without using the Morita theory? 


3. Show that if R is a p.i.d., then any ring similar to R is isomorphic to a matrix ring M,(R). 
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Basic Structure Theory of Rings 


The structure theory of rings, which we shall consider in this chapter, is an outgrowth of the 
structure theory of finite-dimensional algebras over a field that was developed during the latter 
part of the nineteenth century and the early part of this century. The discovery of quaternions by 
Hamilton in 1843 led to the construction of other “hypercomplex number systems,” that is, 
finite dimensional algebras over R and €, including some non-associative ones (e.g., the 
octonions of Cayley and Graves). The problem of classifying such systems was studied 
intensively during the second half of the nineteenth century by a number of mathematicians: 
Molien, Frobenius, Cartan, and others. Their results constituted a very satisfactory theory for 
algebras over R and €, which included a complete classification of an important subclass, the 
semi-simple algebras. This structure theory was extended to finite dimensional algebras over 
an arbitrary field by Wedderburn in 1908. In 1927, Artin, stimulated by Emmy Noether’s 
earlier strikingly successful simplification of ideal theory of commutative rings by the 
introduction of chain conditions, and motivated by the needs of arithmetic in finite dimensional 
algebras, extended the Wedderburn theory to rings satisfying chain conditions for one-sided 
ideals. In 1945 this author developed a structure theory of rings without finiteness assumptions. 
The principal ingredients of this theory were a new definition of a radical, the concepts of 
primitivity and semi-primitivity (formerly called semisimplicity), and a density theorem (due 
independently to Chevalley) that constituted an extensive generalization of a classical theorem 
of Burnside. 

In this chapter we shall reverse the historical order of development by first giving an 
account of the general theory and then specializing successively to Artin’s theory and to 
Wedderburn’s theory of finite-dimensional algebras over fields. The Wedderburn structure 
theorem for simple algebras reduces the classification of these algebras to division algebras. 
The proper vehicle for studying these is the Brauer group of algebra classes with composition 
defined by tensor products. We shall give an introduction to the study of these groups and 
related results on central simple algebras. At the end of the chapter we shall apply the results 
to the study of Clifford algebras. These are needed to round out the structure theory of 


orthogonal groups that was presented in Chapter 6 of BAI. Further applications of the theory to 
representation theory of finite groups will be given in the next chapter. 


4.1 PRIMITIVITY AND SEMI-PRIMITIVITY 


In the development of ring theory that we shall give, rings of endomorphisms of abelian groups 
and concretizations of rings as rings of endomorphisms play a predominant role. We recall that 
if M is an abelian group (written additively), the set of endomorphisms of M has a natural 
structure of a ring. Thus we obtain End M (or Endy M), the ring of endomorphisms of M. By a 
ring of endomorphisms we mean a subring of End M for some abelian group M. We define a 
representation p of a ring R to be a homomorphism of R into a ring End M of endomorphisms 
of an additive abelian group M. A representation p of R acting on M (that is, with codomain 
End M) defines a left R-module structure on M by specifying that the action of R on M is given 


through p: 
(1) ax = p(a)x 


for a e R, x € M. Equation (1) defines a left module action since p(a) is an endomorphism and 
a ~ p(a) is a ring homomorphism. Conversely, if M is a left module for the ring R, then M is 
an abelian group and M affords a representation p = p y of R by defining ay for a € R to be the 


endomorphism 
Ay 2X ~> aX. 


Then p : a ~ aj, is a homomorphism of R into End M, hence a representation of R. 


By an irreducible representation of a ring we shall mean a representation p for which the 
associated module M (as just defined) is irreducible: M # 0 and M and 0 are the only 
submodules of M. The structure theory that we shall develop is based on two articles of faith: 
irreducible representations are the best kinds of representations and the best behaved rings are 
those that have enough irreducible representations to distinguish between elements of the ring, 
that is, given a pair of distinct elements a, b of R, there exists an irreducible representation p 
such that p(a) + p(b). Evidently this is equivalent to the simpler condition that given any a £ 0 
in R there exists an irreducible representation p of R such that p(a) £ 0. 

Before we proceed to the main definitions, it will be useful to collect some elementary 
facts on representations and modules. Because of the connection between representations and 
left modules that we have noted, we give preference in this chapter to left modules. 
Accordingly, in this chapter the unadorned term “module” will always mean left module. 

If p is a representation of the ring R acting in M, the kernel ofp is evidently the ideal 


(2) anny M = {be R|bM = 0}, 


(bx = p(b)x). We shall call the representation p (and occasionally the module M) faithful if ker 


p = 0. For any x e M we have the order ideal or annihilator of x, annpx = {b e R\bx = 0}. This 
is a left ideal and the cyclic submodule Rx generated by x is isomorphic to R/annpgx where R is 
regarded as left R-module in the natural way. It is evident that 


(3) ker p = annęM = () anngx. 


xeM 


If fis a homomorphism of a ring S into R and p is a representation of R acting on M, then pf 
is a representation of S acting on M. The corresponding module action of S on M is given by 


(4) bx = f (b)x 


for b e S. As in the last chapter, the notations ¿M and pM serve to distinguish M as S-module 


from M as R-module (as well as to indicate that the action is on the left). In most cases, it will 
be clear from the context which of these is intended and “M” will be an adequate notation. It is 
clear that 


(5) anny M =f '(ann,M) 


where, of course, f/( ) denotes the inverse image under f. If the representation p of R is 
faithful, then annçM = f !(0) = kerf 

Let B be an ideal of R contained in annpM. Then M becomes an R = R/B module by defining 
the action of the coset a + B on x € M by 


(6) (a+B)x = ax. 


Since bx = 0 for every b e B, x € M, it is clear that (6) is a well-defined action. It is 
immediate also that this is a module action of R on M and it is clear that a subset N of M is an 
R-submodule if and only if it is an R-submodule. In other words, pM and RM have the same 


lattice of submodules. The relation between the kernels of the representations of R and of R is 
given by the formula 


(7) anngM = (anng M)/B. 
We now proceed to introduce two of our main definitions. 


DEFINITION 4.1. A ring R is called primitive if it has a faithful irreducible 
representation. R is called semi-primitive if for any a + 0 in R there exists an irreducible 
representation p such that p(a) + 0 


We shall be interested also in representations that are completely reducible in the sense that 
the corresponding module M is completely reducible, that is, M = $} M, where the M, are 
irreducible submodules. We establish first two characterizations of semi-primitivity in the 
following 


PROPOSITION 4.1. The following conditions on a ring R are equivalent: (1) R is semi- 
primitive. (2) R has a faithful completely reducible representation. (3) R is a subdirect 
product of primitive rings (see p. 69). 


Proof. (1) = (2). For each a # 0 in R let M, be an irreducible module such that for the 
representation py we have p y(a) # 0. Form M = ©,,9M,. This is a completely reducible 
module and it is clear that 


(8) ann, M = (\ anngM, = 0. 
ax) 
Hence py is a faithful completely reducible representation for R. 
(2) = (3). Suppose p is a faithful completely reducible representation for R so M = $ Mp 
M, 
annpM, is an ideal in R. Hence R is a subdirect product of the rings R, = R/annpM,,. On the 


irreducible, for the corresponding module M. We have 0 = anngM = [)\,annpM,, and 


other hand, since (annrM,)M, = 0, M, can be regarded as an irreducible module for R and the 
representation of R, defined by M, is faithful. Hence R, is primitive and R is a subdirect 
product of primitive rings. 

(3) = (1). Let R be a subdirect product of the primitive rings R,. For each R, let p, be a 
faithful irreducible representation of R,. We have the canonical homomorphism z, of R onto R, 
and |, ker z, = 0. We have the irreducible representation p,,, of R whose kernel is ker 7,. 
Accordingly, we have a family of irreducible representations {p,7,,} of R such that! |, ker p,7, 
= (0. Then R is semi-primitive. 0 


The definitions we have given thus far involve objects (representations and modules) that 
are external to the ring R. It is easy and useful to replace the definitions by internal 
characterizations. Let 7 be a left ideal of the ring R. We define 


(9) (I:R)= {beR]bR c I}. 
It is clear that if we put M = R/I and regard this as a left R-module then 
(10) (I: R) = anngR/I. 


It follows from this or directly from (9) that (7 : R) is an ideal. Moreover, by(9), (I : R) c I and 
(I : R) contains every ideal of R contained in /. In other words, (J : R) is the (unique) largest 
ideal of R contained in /. We can now give the following characterization of primitivity and 
semi-primitivity. 


PROPOSITION 4.2. A ring R is primitive if and only if R contains a maximal left ideal I 
that contains no non-zero ideal of R. A ring R is semi-primitive if and only if R# 0 and ( \(I : 


R) = 0 where the intersection is taken over all maximal left ideals I of R. 


Proof. If R is primitive, we have an irreducible module M such that annpM = 0. Now M = R/I 
for a maximal left ideal Z (p. 117). Then (Z : R) = annpR/J = annpM = 0. Thus J is a maximal 
left ideal containing no non-zero ideal of R. Conversely, if the condition holds, we have a 
maximal left ideal J such that (J : R) = 0. Then M = R/I provides a faithful irreducible 
representation of R, so R is primitive. 

Now let R be semi-primitive and let {p,,} be a set of irreducible representations of R that is 
adequate for distinguishing the elements of R. Then | | ker Pa = 9. If M, is the module for p, 
then M, = R/I,, I, a maximal left ideal in R. Hence (Z, : R) = anngM,, = ker p, and! \,(Z, : R) = 
0. A fortiori, | (Z : R) taken over all of the maximal left ideals of R is 0. The converse follows 
by retracing the steps. 0 


The internal characterizations of primitivity and semi-primitivity have some important 
consequences that we shall now record. The first of these is 


COROLLARY 1. Any simple ring (+0) is primitive. 


Proof. Any ring R £ 0 contains a maximal left ideal J (p. 68). If R is simple, (7 : R) = 0. Hence 
R is primitive. O 


Next we obtain characterizations of primitive and semi-primitive commutative rings. 


COROLLARY 2. Let R be a commutative ring. Then R is primitive if and only if R is a 
field and R is semi-primitive if and only if it is a subdirect product of fields. 


Proof. If R is a commutative primitive ring, R contains a maximal left ideal Z containing no 
ideal 40 of R. Since R is commutative, / is an ideal. Hence Z = 0. Then 0 is a maximal ideal in 
R, which means that R + 0 and 0 and R are the only ideals in the commutative ring R. Then R is 
a field. Conversely, it is clear that any field satisfies the first condition of Proposition 4.2. 
Hence any field is primitive. It now follows from Proposition 4.1.3 that R is semi-primitive if 
and only if it is a subdirect product of fields. 0 


We shall now give some examples of primitive and semi-primitive rings. A convenient way 
of constructing examples of rings is as rings of endomorphisms of abelian groups. If R is a ring 
of endomorphisms of an abelian group M, then M is an R-module in the natural way in which 
the action ax, a e R, x € M is the image of x under a. Obviously, the corresponding 
representation is the injection of R into End M and so this is faithful. Hence if R acts 
irreducibly on M £ 0 in the sense that there is no subgroup of M other than 0 and M that is 
stabilized by R, then R is a primitive ring. Several of the following examples are of this type. 


EXAMPLES 


1. The ring L of linear transformations of a finite dimensional vector space over a division 
ring is simple (p. 171). Hence L is primitive. 


2. Let V be a vector space over a division ring A that need not be finite dimensional. We 
claim that the ring L of linear transformations in V over A acts irreducibly on V and hence is a 
primitive ring. To prove irreducibility, we have to show that if x is any non-zero vector and y 
is any vector, there exists an/ e L such that Zx = y. Now this is clear, since we can take x to be 
an element in a base (ea for V over A, and given any base (e,) and corresponding elements f, 
for every e, then there exists a linear transformation / such that le, = fẹ. If V is infinite 
dimensional, the subset F of transformations / with finite dimensional range (/(V) finite 
dimensional) is a non-zero ideal in L. Since 1 É F, F 4 L. Hence L is not simple, so this is an 
example of a primitive ring that is not simple. If the dimensionality of V is countably infinite, it 
is easy to see that F is the only proper non-zero ideal of L. Hence L/F is simple and thus 
primitive. 


3. If (X1,%7,x3,...) is a base for a vector space V over a division ring A, then a linear 
transformation / is determined by its action on the base. We can write 


(11) Ix; =) Aix; 
j 


where the 2, A and for a given i only a finite number of the 4; are #0. Thus / can be 
described by a matrix A = (4;), which is row finite in the sense that there are only a finite 


number of non-zero entries in each row. The set of row-finite matrices with entries in A is a 
ring under the usual matrix addition and multiplication, and the map Z + A where A is the matrix 
of / relative to the base (x;) is an anti-isomorphism. Now let R be the set of linear 


transformations whose matrices relative to the given base have the form 


® 0 
(12) A 


0 


where © is a finite square block and 4 € A. The set of matrices of the form (12) is a subring of 
the ring of row-finite matrices. Hence R is a subring of the ring of linear transformations L. R 
acts irreducibly on V. For, if x is a non-zero vector and y is any vector, then we can write x = 
Ex +... + 6x, Y = NX +... + 1x, for some n where the ¢, n; € A. Thus x and y are 
contained in a finite dimensional subspace V’ of V. There exists a linear transformation /’ of V' 
into itself such that l'x = y and /' can be extended to a linear transformation / of V contained in 
R (for example, by specifying that lx; = 0 if i > n). Then /x = y. Hence V is irreducible as R- 


module and so R is primitive. 


4. Let V be as in example 3 with A = F, a field, and let p and q be the linear transformations 
in V over F such that 


(13) PX, = 0, PX; = Xj;~-15 i> | 
(Xi = Xj4+15 t= Lote dsavan 


Then pqx; = x; for all i and qpx; = 0, gpx; =x; for i> 1. Hence 
(14) pq=1, qp#il. 


Let R be the set of linear transformations 
n 


(15) À adip, n = 0,1,2... 


i,j=0 


where the o,; € x. The first relation in (14) implies that R is closed under multiplication, so R 
is a ring of endomorphisms of V. Now put 


(16) e =g pgr, = i,j=1,2,.... 


The matrix of e;; has a 1 in the (j,i)-position, 0’s elsewhere. It follows that the matrix ring 


corresponding to R includes all the matrices (12). Thus R contains the ring given in example 3 
as a Subring. Hence the present R acts irreducibly on V, so R is primitive. 


5. For any ring R we have the regular representation pp whose module is the additive 
group of R on which the ring R acts by left multiplication (see BAI, pp. 422—426). This 
representation is faithful and the submodules of R are the left ideals. Now let A be an algebra 
over the field F. Then the regular representation p4 of A provides a monomorphism of A into 
the algebra L of linear transformations of the vector space A/F. Thus any algebra over a field 
can be imbedded in a primitive ring. We shall now show that A is also a homomorphic image 
of a primitive ring. Since A is an arbitrary algebra over a field, this will show that there is not 
much that can be said about homomorphic images of primitive rings. For our construction we 
take V = ®A, a direct sum of a countably infinite number of copies of the vector space A. Let A 
act on V in the obvious way and identify A with the corresponding algebra of linear 
transformations in V. For n = 1, 2, 3,... let V,, be the subspace of V of elements of the form 
(X1,%,---X,,0, 0,...), x; E A, and let LZ, be the set of linear transformations that map V, into 
itself and annihilate all of the elements (0,...,0,x,4 1X,49-:-). Then LZ, CL, œv. Put L = UL; 
and let R be the ring of linear transformations generated by A and L. Then R acts irreducibly on 
V, so R is primitive. Moreover, R = A+L, A N L =0, and L is an ideal in R. Then R/L = A, so 
the given algebra A is a homomorphic image of the primitive algebra R. 


6. 


The ring Z is semi-primitive since Plain) = 0 and 2/(p) is a field. Hence Z is a 


subdirect product of fields, so Z is semi-primitive. Similarly any p.i.d. with an infinite number 
of primes is semi-primitive. 


— 


EXERCISES 


. Let F{x,y} be the free algebra generated by x and y, and let K be the ideal in F'{x,y} 


generated by xy — 1. Show that F{x,y}/K is isomorphic to the algebra R defined in 
example 4 above. 


. (Samuel.) Let V,F be as in example 4 and let S be the algebra of linear transformations 


generated by p as in example 4 and the linear transformation r such that rx; = x7 + 1. 
Show that S is primitive and S is isomorphic to the free algebra F'{x,y}. 


. Let F {x,y} be as in exercise 1 and let K be the ideal generated by xy — yx — x. Assume 


that F is of characteristic 0. Show that R = F'{x,y}/K is primitive. 


. Let R be a ring containing an ideal N £ 0 that is nilpotent in the sense that there exists an 


integer m such that N” = 0 (that is, the product of any m elements of N is 0). Show that R 
is not semi-primitive. 


. Show that if R is primitive then the matrix ring M,(R) is primitive. 


. Show that if R is primitive and e is an idempotent element 40 in R, then the ring eRe 


(with unit e) is primitive. 


. Show that if R and R’ are Morita similar (p. 179), then R is primitive if and only if R’ is 
primitive. 
. Let F be a field, P = F[tj,..., ¢,], the ring of polynomials in n indeterminates with 


coefficients in F. Show that for any i = 1, 2,...,n, there exists a unique derivation D; of P 
such that 


Dt, = Oi) 


(see exercise 11, p. 147). For f e P let J denote the multiplication g ~ fg in P. Show 
that if D is a derivation in P and f € P, then /D is a derivation in P. Show that the 
derivations of P into P are the maps 2i/:2: where D; is as above and f; € P. Let R be the 
ring of endomorphisms of the additive group of P generated by the derivations and the 
multiplications of P. Show that if F is of characteristic 0, then R is an irreducible ring of 
endomorphisms and hence R is primitive. 


. Let F be a field of characteristic 0 and F{x,..., Xps ¥j.---. Yny} the free algebra over F 


determined by the 2n elements x),...,X,,, Vp---» Yn. Let J be the ideal in F'{x,..., X,5.--5 Vp 
..., Yny generated by the elements 


[i x], [yi yil, [Vi x;] i Oi) l, 
1 <ij <n. The algebra W, = F{xp..., Xp Vp... Yny is called a Weyl algebra. Show 
that if P and R are as in exercise 8, then the homomorphism y of F{x1,.. -Xps Yis- --Yn3 
into R such that x;+ t; y> Dp 1 <i <n, is surjective and ker 7 = Hence conclude that 
W,, is primitive. 


10. Show that W, is simple. (Suggestion: First treat the case in which n = 1.) 


4.2 THE RADICAL OF A RING 


From the point of view that we took in the previous section, the purpose of defining the radical 
of a ring is to isolate that part of the ring whose elements are mapped into 0 by every 
irreducible representation of the ring. Accordingly, we introduce the following 


DEFINITION 4.2. The (Jacobson) radical of a ring is the intersection of the kernels of the 
irreducible representations of the ring. 


Evidently, the radical, rad R, of the ring R is an ideal in R. 


We shall call an ideal P of R primitive (in R) if R/P is a primitive ring. Let / be a maximal 
left ideal of R and let P = (J: R). Then M = R/I is an irreducible module for R whose 
annihilator is P. Hence M can be regarded as an R = R/P-module by (6) and M is irreducible 
as R-module. By (7), anngM = anngM/P = P/P = 0. Hence M is a faithful irreducible module 


for R and so R is primitive. Then, by definition, P is a primitive ideal in R. Conversely, let P 
be a primitive ideal in R. Then we have an irreducible module M for R = R/P such that the 
associated representation is faithful. Regarding M as R-module via (4) we see that M is an 
irreducible R-module such that annpM = P (see(5). Now M = R/I where J is a maximal left 


ideal of R and hence annpR// = P. Since annpR/J = (I : R), we have P = (I: R). We therefore 
have the 


LEMMA 1. An ideal P of R is primitive in R if and only if P= (I: R) for some maximal left 
ideal I of R. 


We can now give our first internal characterization of the radical. 


PROPOSITION 4.3. (1) rad R is the intersection of the primitive ideals of R. (2) rad R is 
the intersection of the maximal left ideals of R. 


Proof. (1) By definition, 
(17) rad R = {\anngM 


where {M} is the class of irreducible modules of R. Since M = R/I and ann R/I = (I : R), we 
have 


(18) rad R = {\(I:R) 
where / runs over the maximal left ideals of R. By the lemma, this can be written also as 
(18’) rad R =()P 


where P runs over the primitive ideals in R. This proves (1). To prove (2), we recall that for 
any module M we have anngM =! em anngx =! |, +0 annex. If M is irreducible, this expresses 
annęM as an intersection of maximal left ideals. Hence, by (17), rad R is an intersection of 
maximal left ideals, so rad R > (V where Z ranges over the maximal left ideals of R. On the 
other hand, since Z > (/: R), { VD! \(: R) = rad R so 


(19) rad R = (\I 
where the intersection is taken over the set of maximal left ideals of R. O 


We prove next 


PROPOSITION 4.4. (1) R is semi-primitive if and only if rad R = 0. (2) If R #0, then R/rad 
R is semi-primitive and rad R is contained in every ideal B of R such that R/B is semi- 
primitive. 
Proof. (1) Ifrad R = 0, then! \P = 0 for the primitive ideals of R. Then R is a subdirect product 
of the primitive rings R/P and R is semi-primitive by Proposition 4.1. Conversely, if R is semi- 
primitive, R is a subdirect product of primitive rings and so there exists a set of primitive 
ideals whose intersection is 0. Then rad R =! \P = 0 if P runs over the set of primitive ideals of 
R. 

(2) If B is an ideal of R, then any ideal of R = R/B has the form P/B where P is an ideal of R 


containing B. Since R/P = R/ P where P = P/B, it is clear that P is primitive in R if 


and only if P is primitive in R. Since R is semiprimitive if and only if | P = 0, P 

primitive in R, it follows that R is semiprimitive if and only if B is the intersection of all of the 
primitive ideals of R containing B. Then B > rad R, by (18’). If B = radR, (18°) implies that B 
is an intersection of primitive ideals. Then R/B = R/rad R is semi-primitive. This proves (2). D 


We shall give next an important element characterization of the radical. For this purpose, 
we introduce a number of definitions. First, we call an element z of a ring R left (right) quasi- 
regular in R if 1 — z has a left (right) inverse in R. Evidently, left (right) quasi-regularity is 
equivalent to the following: the principal left (right) ideal R(1 — z) ((1 — z)R) = R. If z is 
both left and right quasi-regular, then z is called quasi-regular. A left (right) ideal J is called 


quasiregular if all of its elements are left (right) quasi-regular. 


Ifz is left quasi-regular, 1 — z has a left inverse that we can write as 1 — z’. Then (1 — z’) 
(i — z) = 1 gives the relation 


(20) Zoz=7z4+2'-77z=0, 


The binary product z’ O z = z + z' — z'z defines the circle composition O in the ring R. Let o 
denote the map x ~ 1 — x in R. Then o° = 1, so ø is bijective. We have 


1-—(1—z’')(1—-z) 
a((a(z')o(z)) 


a '(a(z')a(z)). 


This implies that the circle composition is associative. Also since o(0) = 1, 0oz = z = zo0. 
Thus (R,0,0) is a monoid. 
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LEMMA 2. (1) IfI is a quasi-regular left (right) ideal, then I is a subgroup of (R, o, 0). (2) 
Nilpotent elements are quasi-regular. 


Proof. (1) Letz e l. Then we have az’ e R such that z’ O z = 0. Then z + z’ — z'z = 0 and z’ =z 
'z — z € I. Hence there exists a z” such that z” O z' = 0. Since (R,O,0) is a monoid, the 
equations z’O z = 0 =z" Oz’ imply that z" = z, so z' is the inverse of z in (R, O, 0) and z' € J. 
Since / is closed under O and contains 0, statement (1) is clear. 

(2) fz” =0, then (1 — 2) (1 +z +2 +... +77 =1=(1+z+2+... +77) (11—732). 
Hence z is quasi-regular. D 


An ideal (left, right ideal) is called nil if all of its elements are nilpotent. The second part 
of Lemma 2 shows that any nil left (right) ideal is quasi-regular. 


We can now give the following element characterizations of rad R. 


PROPOSITION 4.5. (1) radR is a quasi-regular left ideal that contains every quasi-regular 
left ideal of R. (2) rad R is the set of elements z such that az is left quasi-regular for every a 
E R. 


Proof. (1) Suppose z € rad R is not left quasi-regular. Then R(1 —z) # R, so this left ideal can 
be imbedded in a maximal left ideal /. Since rad R is the intersection of the maximal left ideals 
of R,z e J. Butl—ze R(1 —z) C/,so 1 =1—z+z€e / contrary to / = R. This contradiction 
shows that every z € rad R is left quasi-regular, so it proves that rad R is a quasi-regular left 
ideal. Now let Z be any quasi-regular left ideal of R. If Z $ z rad R, then there exists a maximal 
left ideal Z such that Z $ I. Then Z + Z ? I and so Z + Z = R. Then 1 = b + z for some be J,ze 
Z, and 1 = (1 — z) !b e I contrary to / = R. Thus Z c R and part (1) is proved. 

(2) Ifz € rad R, then az € rad R for any a in R, so az is left quasi-regular. Conversely, 


suppose z satisfies this condition. Then Rz is a quasi-regular left ideal. Hence Rz c rad R by 
(1) andze rad R. O 


The concept of primitivity of a ring and of an ideal in a ring is not left-right symmetric. For, 
there exist primitive rings that are not right primitive in the sense that they have no irreducible 
right modules M’ such that the associated anti-representation p':a~+ p'(a), where p(a)x = xa, 
is monic. The first examples of such rings were given by George Bergman. In spite of this lack 
of symmetry in the concept of primitivity, the concepts of the radical and semi-primitivity are 
symmetric. Evidently, everything we have done can be carried over to antirepresentations, 
right modules, and right ideals. Suppose we denote the corresponding right radical as rad’ R. 
Then we have the analogue of Proposition 4.5 for rad’ R. On the other hand, by Lemma 2, the 
elements of rad R and rad’ R are quasi-invertible. Hence, by Proposition 4.5 and its analogue 
for right ideals, we have rad’ R c rad R and rad R c rad’ R. Thus rad R = rad’ R. This permits 
us to give a number of additional characterizations of rad R. We give these and summarize the 
ones we obtained before in 


THEOREM 4.1. We have the following characterizations of the radical of a ring: radR is 


. the intersection of the primitive ideals of R, 

. the intersection of the maximal left ideals of R, 

. the intersection of the right primitive ideals of R, 

. the intersection of the maximal right ideals of R, 

. a quasi-regular left ideal containing every quasi-regular left ideal, 

. the set of elements z such that az is left quasi-regular for every aé R, 

. a quasi-regular right ideal containing every quasi-regular right ideal, 
. the set of elements z such that za is right quasi-regular for everyaé R. 


CONN NB WN 


It is clear also that rad R contains every nil right ideal as well as every nil left ideal of R. 
Moreover, since right semi-primitivity (defined in the obvious way) is equivalent to rad’R = 0, 
it is clear that a ring is semi-primitive if and only if it has a faithful completely reducible anti- 
representation and if and only if it is a subdirect product of right primitive rings. Another way 


of putting the left-right symmetry is that if we regard R and R” as the same sets, then rad R = 
rad R” and if R is semi-primitive, then so is RP. 


EXERCISES 

1. Show that a ring R is a local ring (p. 111) if and only if R/rad R is a division ring and that 
if this is the case, then rad R is the ideal of non-units of R. 

2. Determine the radical of 2/(n), n => 0. 


3. (McCrimmon.) Show that z € rad R if and only if for every a € R there exists a w such 


that z + w = waz = zaw. 
4. Show that ab is quasi-regular if and only if ba is quasi-regular. 


5. Call an element a of a ring R von Neumann regular if there exists a b such that aba = a. 
Show that the only element in rad R having this property is 0. (A ring is called von 
Neumann regular if all of its elements are von Neumann regular. Hence this exercise 
implies that such rings are semi-primitive.) 


6. Let R be a ring such that for any a € R there is an integer n(a) > 1 such that a” = a. 
Show that R is semi-primitive. 


7. Let e be a non-zero idempotent in a ring R. Show that rad eRe = e(rad R)e = eRe N rad R. 
(Sketch of proof: Show that every element of e(rad R)e is quasiregular in eRe, so e(rad 
R)e c rad eRe. Next letz € rad eRe and leta € R. Write a = eae + ea(1 — e) + (1 — e)ae 
+ (1 — e)a(1 — e) (the two-sided Peirce decomposition). Then za = zeae + zea( 1 —e) 
and zeae has a quasi-inverse z’ in eRe. Then za O z' = zea(1 — e), which is nilpotent, thus 
quasi-regular. Hence za is right quasiregular. Hence z e rad R N eRe and rad eRe c 
e(rad Re.) 

8. Show that for any ring R, rad M,(R) = M,(rad R). (Sketch of proof: We have a 1-1 
correspondence B ~ M,(B) of the set of ideals of R onto the set of ideals of M,(R) (see 
exercise 1, p. 171). Let {eJl <i, j < n} be a set of matrix units for M„(R). We have the 
isomorphism r ~ (rl)e;; of R onto e;; M,(R)e,;. Under this the image of rad R is 
rade,,M,(R)e1, = ey, radi, (R)e,, (by exercise 7) = e,;M,,(B)ej. It follows that B = rad 
R.) 

9. Show that if R and R’ are (Morita) similar, then in any correspondence between the 
ideals of R and R' given by Morita I, the radicals of R and R’ are paired. 


10. Prove that any ultraproduct of semi-primitive (primitive) rings is semi-primitive 
(primitive). 


14.3 DENSITY THEOREMS 


If V is a vector space over a division ring A, then a set S of linear transformations in V is 
called dense if for any given finite set of linearly independent vectors x,,x>, ..., x, and 


corresponding vectors y4, Y2, ..., Yp there exists an/ € S such that ix; =y; 1 <i<n. If Vis finite 
dimensional, then we can take the x; to be a base and y; = ax; for any given linear 
transformation a. Then we have an/ € S such that /x; = ax; 1 <i <n, from which it follows that 


l= a. Hence the only dense set of linear transformations in a finite dimensional vector space is 
the complete set of linear transformations. 
In this section we shall prove a basic theorem that permits the identification of any 


primitive ring with a dense ring of linear transformations. We consider first a more general 
situation in which we have an arbitrary ring R and a completely reducible (left) module M for 
R. Let R' = EndpM, R" = Endp.M where M is regarded in the natural way as a left R’-module. 


We shall prove first the 


DENSITY THEOREM FOR COMPLETELY REDUCIBLE MODULES. Let M be a 
completely reducible module for a ring R. Let R'= Endp M, R" = Endp M. Let {x,,..., Xp} be 


a finite subset of M, a" an element of R". Then there exists ana E€ R such that ax;=a"x;, 1 < 
Lan, 


The proof we shall give is due to Bourbaki and is based on the following pair of lemmas. 
LEMMA 1. Any R-submodule N of M is an R"-submodule. 


Proof. Since M is completely reducible as R-module, we can write M = N ® P where P is a 
second submodule (p. 121). Let e be the projection on N determined by this decomposition. 
Then e e R'and N= e(M). Hence if a” € R", then a"(N) = a"e(M) = ea"(M) c N. Thus N is a 
submodule of pr M. O 


LEMMA 2. Let M be a module, M” the direct sum of n copies of M, n = 1, 2,... Then 


EndpM) is the set of maps (uy Uy, ...5 Up) = (V1 Vo, ---, Vp) where v; = F, d'jtp ay € R'= 


EndpM. Moreover, for any a" € R" = EndpM, the map (u, uz, ..., U,)+(a"Uz, AU, ..., a"U,) 
is contained in the ring of endomorphisms of M” regarded as a left EndpM"-module. 


Proof. Let 1 € EndpM, and consider the action of J on the element (0, ..., 0, u; 0, ..., 0) 
where u; is in the ith place. We have /(0,..., 0, u,0, ..., 0) = (uy; Uo; -.-, Uni), Which defines the 
hi we obtain /(uy, 
-ao Up) = (LAM LAM Lanu), Tt is clear also that any map of this form is contained in 


map u; ~œ uj; 1 <j <n. This is contained in EndgM. Denoting this map as a 


End,M), This proves the first assertion. The second follows, since for any a” € R" the map 
(Uj, Uz, ..., Up) ~ (a"uj, a" Uy, ..., a'u) commutes with every map (uj, Uz, ..., Up) ~ 
(Ea itn Laun. Dapit), a c R.O 


We can now give the 


Proof of the theorem. Suppose first that n = 1. Then N = Rx, is an R -submodule, hence an R” - 
submodule. Since x, € N, a"xı € N= Rx. Hence we have ana e R such that ax, = a"x,;. Now 
suppose n is arbitrary. Consider M®” the direct sum of n copies of the R-module M. The 
complete reducibility of M implies that M” is completely reducible. By Lemma 2, if a” € R", 
the map (u, U>, ..., U,) (a"u;, A", ..., a"u,) is in the ring of endomorphisms of M”, regarded 
as left End,M”) module. We apply the result established for n = 1 to this endomorphism and 


the element x = (x), ..., X„) of M. This gives an element a € R such that ax = (a"x, ..., a"x,,). 
Then ax; = a"x;, 1 <i <n, as required. O 


There is a good deal more that can be done at the level of completely reducible modules 
(see the exercises below). However, for the sake of simplicity, we shall now specialize to the 
case that is of primary interest for the structure theory: a faithful irreducible representation of a 
ring. In this case we have the 


DENSITY THEOREM FOR PRIMITIVE RINGS. 4 ring R is primitive if and only if it is 
isomorphic to a dense ring of linear transformations in a vector space over a division ring. 


Proof. Suppose R is a primitive ring, so R has a faithful irreducible representation p acting in 
M. By Schur’s lemma, A = Endp™ is a division ring. Since p is faithful, R is isomorphic to 
p(R), which is a ring of linear transformations in M regarded as a vector space over A. Now 
let x), ..., x, be linearly independent vectors in M, y;, ..., Y, arbitrary vectors. Since the x’s 
can be taken to be part of a base, there exists a linear transformation / such that x;=y,, 1 < i < 
n. Since l e End,M and 4 = EndpM, it follows from the previous theorem that there exists an a 
€ R such that ax; = /x;=y,, 1 <i < n. Then p(R) is a dense ring of linear transformations in M 


over A. Conversely, suppose R is isomorphic to a dense ring of linear transformations in a 
vector space M over a division ring 4. If p is the given isomorphism, then putting ax = p(a)x 
for a e R,x € M, makes M an R-module. Moreover, this is irreducible since if x # 0 and y is 
arbitrary, we have ana € R such that ax = y. Hence p is a faithful irreducible representation of 
R and so R is primitive. O 


For some applications it is important to have an internal description of the division ring A 
in the foregoing theorem. This can be obtained by identifying an irreducible R-module with an 
isomorphic one of the form R/I where J is a maximal left ideal of R. We need to determine 
Endg(R/I). We shall do this for any left ideal Z or, equivalently, for any cyclic module. We 


define the idealizer of I in R by 
(21) B = {be R|Ib c I}. 


It is clear that this is a subring of R containing / as an ideal and, in fact, B contains every 
subring of R having this property. We have 


PROPOSITION 4.6. IfI is a left ideal in R, then Endp(R/J) is anti-isomorphic to B/I where 
B is the idealizer of I in R. 


Proof. Since 1 + J is a generator of R/I, fe Endp(R/J) is determined by A1 +D =b+/=5(1 
+ 1) as the map 


(22 a+] =a(1+I)~ a(b4+I) = ab. 


Ifae l,a+I=]I= 0 in R// and hence its image ab + I= I. Then ab e l. Thus b e the idealizer 
B of I. Conversely, let b € B and consider the homomorphism of the free R-module R into R// 
sending 1 ~ b + /. Any d € lis mapped into db + / = I by this homomorphism. Hence we have 
an induced homomorphism of R// into R/J sending 1 + 7 ~ b + I. Thus any b € B determines an 
endomorphism of R/I given by (22). Denote the correspondence between elements of B and 
endomorphisms of R// determined in this way by g. Evidently g is additive and g(1) = 1. Also 
if b}, ba E B, we have g(b,b,)(1 + 1) = bib, + I and g(b,)g(b,) (1 + D = g(b2)(6, + D) = by by + 
I. Hence g is an antihomomorphism of B onto Endp(R//). The ideal in B of elements mapped 
into 0 by this map is /. Hence B/I and End,(R//) are anti-isomorphic. O 


The reader should be warned that in spite of this nice determination of the division ring 4 in 
the density theorem, given by Proposition 4.6, this division ring is not an invariant of the 
primitive ring. A given primitive ring may have non-isomorphic irreducible modules giving 
faithful representations and even some irreducible modules whose endomorphism division 
rings are not isomorphic. There are important cases in which the division ring is an invariant 
of the primitive ring. For example, as we shall show in the next section, this is the case for 
artinian primitive rings. Our next theorem gives the structure of these rings. 


THEOREM 4.2. The following conditions on a ring R are equivalent: 
(1) R is simple, left artinian, and non-zero. 
(2) R is primitive and left artinian. 
(3) R is isomorphic to a ring End,M where M is a finite dimensional vector space over 
a division ring A 


Proof. (1) + (2) is clear. 
(2) = (3). Let M be a faithful irreducible module for R, A = EndpM. We claim that M is 
finite dimensional over 4. Otherwise, we have an infinite linearly independent set of elements 


pi = 1, 2, 3,...} in M. Let J; = anngx; so J; is a left ideal. Evidently, 71 ^ OTa is the subset of 
R of elements annihilating x),..., x,,. By the density theorem there exists an a € R such that ax, 
=" = AX, = 0, ax, +1 Ż 0. Hence 0h P h AA la: Then 


h??? 


is an infinite properly descending sequence of left ideals in R contrary to the artinian property 
of R. Thus M is finite dimensional over 4 and if we let (x), X2, ..., X„) be a base for M over 4, 


the density property implies that for any y4, ..., y, there exists ana € R such that ax; =y, 1 <i 
<n. Hence ifp is the representation determined by M, p(R) = End,™ so R is isomorphic to the 
ring of linear transformations in the finite dimensional vector space M over 4. 


(3) = (1). This has been proved in proving the Wedderburn-Artin theorem (p. 171). It can 
also be seen directly by using the anti-isomorphism of End,M with M,(4) and recalling that if 


R is any ring, the map B ~ M_,(B) is an isomorphism of the lattice of ideals of R with the lattice 


of ideals of M,,(R) (exercise 8, p. 103 of BAI; or exercise 1, p. 171). The result then follows 
from the simplicity of 4. 0 


EXERCISES 


. Let R be a primitive ring, M a faithful irreducible module for R, A = EndpM. Show that 


either R = M,(A) for some n or for any k = 1, 2, 3,..., R contains a subring having M,(4) 
as a homomorphic image. 


. Use exercise 1 to show that if R is a primitive ring such that for any a € R there exists an 


integer n(a) > 1 such that a” = a, then R is a division ring. 


. Call a set S of linear transformations k-fold transitive if for any 1 < k linearly 


independent vectors x,,..., x, and arbitrary vectors y,,..., yı there exists ana € S such 
that ax; = y; 1 < i < lL. Show that a ring of linear transformations that is two-fold 
transitive is dense. 


In the next three exercises, M = pM is completely reducible, R’ = EndpM, R" = Endp M. 


4. 


Show that any homomorphism of a submodule pN of pM into pM can be extended to an 
endomorphism of pM (an element of R’) and that if pN is irreducible, any non-zero 
homomorphism of pN into pM can be extended to an automorphism of pM. 


. Show that if x # 0 is an element of an irreducible submodule pN of pM, then R is an 


irreducible submodule of p: M. (Hint: Let y = a'x #0, a’ € R', and consider the map u ~ 
a'u of pN = Rx into pM.) 


. Show that p’ M is completely reducible and that its dimensionality (defined on p. 125) is 


finite if R is left artinian. Show also that in this case R” = p(R) where p is the 
representation defined by M. 


. Let M be a completely reducible R-module, B an ideal in R. Show that the following 


conditions are equivalent: (1) BM = M. (ii) Ifx € M satisfies Bx = 0, then x = 0. (111) For 
any x, x E€ Bx. 


. Prove the following extension of the density theorem for completely reducible modules: 


If R, M, and B are as in exercise 7 and B satisfies (1), (11), (111), then for any a” € R" and 
X1,...,X, E M there exists a b € B such that bx;=a"x, 1 <i<n. 


. Use exercise 8 to prove the simplicity of the ring of linear transformations in a finite 


dimensional vector space over a division ring. 


1.4 ARTINIAN RINGS 


In this section we shall derive the classical results on structure and representations of (left) 
artinian rings. We shall show first that the radical of such a ring is nilpotent. We recall that if B 
and C are ideals of a ring R, then BC = |Xicilb;€ B,e;€C}, This is an ideal in R. Also, if D is 
another ideal, then (BC)D = B(CD). We define B* inductively by B! = B, B'= B'~! B and B is 
called nilpotent if B% = 0 for some k. This is equivalent to the following: the product of any k 
elements of B is 0. In particular, b% = 0 for every B. Hence B nilpotent implies that B is a nil 
ideal. On the other hand, it is easy to give examples of nil ideals that are not nilpotent (exercise 
1, at the end of this section). 


THEOREM 4.3. The radical of a (left) artinian ring is nilpotent. 


Proof. Let N=rad R, R artinian. By induction, Nd M > M D .... Since these are left ideals, 
there is a k such that P= M = M * ! = .... Then P = P? = N*. Nis nilpotent if and only if P = 
0. Suppose this is not the case and consider the set S of left ideals J of R such that (1) Z c P and 
(2) PI # 0. Since P has these properties, S is not vacuous, and since R is artinian, S contains a 
minimal element /. Evidently, Z contains an element b such that Pb # 0. Then Pb is a left ideal 
contained in J and in P and P(Pb) = P?b = Pb + 0. Hence Pb = J by the minimality of J in S. 
Since b € I, we have az e P such that zb = b or (1 — z)b = 0. Since z e P c rad R, zis quasi- 
regular. Hence (1 — z)! exists and b = (1 — z) ‘(1 — z)b = 0. This contradicts Pb + 0 and 
proves (rad R) = P=0.0 


We recall that the radical of any ring contains all quasi-regular one-sided ideals, hence all 
nil one-sided ideals of the ring. The foregoing result implies that all such ideals in an artinian 
ring are nilpotent. We shall now derive the main result on the structure of semi-primitive 
artinian rings. For the proof we shall need the 


LEMMA. Let M be an artinian module that is a subdirect product of irreducible modules. 
Then M is a direct sum of a finite number of irreducible submodules. 


Proof. The second hypothesis is equivalent to the following: M contains a set of submodules 
{N,} such that NN, = 0 and every M, = MIN, is irreducible. Consider the set of finite 


intersections Na O° ONa of Nx€1N.}. The artinian condition on M implies that there exists a 
minimal element in this set, say, N; N ... N Np Then for any Nas Na O (N1 9... AN) = NO 


.. IN, s0 N, D N1 NN... N Np Since (ÌN, = 0, we have N, N ... N N,, = 0. Then we have a 


monomorphism of M into TIM, = ®M,, M, = M/N;.. Thus M is isomorphic to a submodule of the 
completely reducible module ®M;. Then M is completely reducible and is a direct sum of 


irreducible modules isomorphic to some of the M,. 0 


We shall now call a ring semi-simple if it is a subdirect product of simple rings. We have 
the following easy consequence of the preceding lemma. 


PROPOSITION 4.7. Jf R is semi-simple and R satisfies the minimum condition for two- 
sided ideals, then R is a direct sum ©" R; of simple rings R,. 


Proof. Let M(R) be the ring of endomorphisms of the additive group of R generated by the left 
and the right multiplications x ~ ax and x ~ xa. This ring is called the multiplication ring of 
R. The additive group R can be regarded in the natural way as M(R)-module and when this is 
done, the submodules of ygyR are the ideals of the ring R. The hypothesis that R is semi- 
simple is equivalent to the following: R as M(R)-module is a subdirect product of irreducible 
M(R)-modules. The other hypothesis of the lemma is also fulfilled. The conclusion then 
follows from the lemma. O 


We can now prove the main 


STRUCTURE THEOREM FOR SEMI-PRIMITIVE ARTINIAN RINGS. The following 
conditions on a ring R are equivalent: 


(1) R is artinian and contains no nilpotent ideals + 0. 
(2) R is semi-primitive and artinian. 

(3) R is semi-simple and artinian. 

(4) pR is a completely reducible R-module. 


(5) R is a direct sum of a finite number of rings R; each of which is isomorphic to the 


ring of linear transformations of a finite dimensional vector space over a division 
ring. 


The implication (1) = (5) is called the Wedderburn-Artin theorem. 


Proof. (1) = (2). This is clear, since the radical of an artinian ring is a nilpotent ideal 
containing all nilpotent ideals and rad R = 0 is equivalent to semi-primitivity. 

(2) = (3) is clear from Theorem 4.2, which establishes the equivalence of simplicity and 
primitivity for artinian rings. 

(2) => (4). Since rad R is the intersection of the maximal left ideals, semi-primitivity of R 
implies that pR is a subdirect product of irreducible modules. Then (4) follows from the 
lemma. 

(4) = (2). The semi-primitivity is clear, since the representation determined by pR is 
faithful and completely reducible. Also we have R = >'Ja, Ja a minimal left ideal. Then 
Pept +e, cele (l,}. Then for aeR, we have a = al = 
= ae + +aeEel ++ Thus R = 1, ++ and hence R = I, 1, ® ... © I; for a subset of the J). 


Then it is clear that 
R>1,4+° +1, 21,4: +1, 3° 31,50 


is a composition series for pR. The existence of such a series implies that R is artinian and 
noetherian. Then (2) holds. 

(3) = (5). Evidently the artinian property implies that R satisfies the minimum condition for 
two-sided ideals. Hence, by Proposition 4.7, R is a direct sum of simple rings. Thus 
R = RıÐR2® ƏR, where the R; are ideals and are simple rings. Then for i +j, RiR; c R; N R; = 
0. It follows that any left ideal of R; is a left ideal of R; hence, each R; is artinian. Then R; is 
isomorphic to the ring of linear transformations of a finite dimensional vector space over a 
division ring. Hence (5) holds. 

(5) =(2). If R = Ri DRG ƏR, where the R; are ideals and R; is isomorphic to End, M;, M; 
a finite dimensional vector space over the division ring 4), then R; is primitive artinian by 
Theorem 4.2, and R is a subdirect product of primitive rings, so R is semi-primitive. Since R; 
is a submodule of pR and the R-submodules of R; are left ideals, pR; is artinian. Hence R is 
artinian. Thus (2) holds. This completes the proof. O 


Let R be any ring and suppose R = R, ©... @Ry where the R; are ideals that are 


indecomposable in the sense that if R: = Ri@R7. Ri, RY ideals, then either R’. = 0 or R” = 0. Now 
let B be any ideal in R. Then B = B1 = BR = BR, +...+ BRs and since BR; c R; we have B = 


BR, ®...® BRs. Also BR; is an ideal of R, so if B is indecomposable then we have BR, = B for 
some i. Similarly, we have B = R,B for some j and since R; N R, = 0 ifi #7, we have B = RB. 


It follows that if R=R1® ƏR; is a second decomposition of R as a direct sum of 
indecomposible ideals, then s = ¢ and the B; are, apart from order, the same as the R;. This 
applies in particular to the decomposition of a semi-simple artinian ring as direct sum of ideals 
that are simple rings. We see that there is only one such decomposition. Accordingly, we call 
the terms R; of the decomposition the simple components of R. 


The structure theorem for semi-primitive artinian rings (or Theorem 4.2) shows that if R is 
simple artinian, then R = End, M, M a finite dimensional vector space over the division ring A. 


We now consider the question of uniqueness of M and A. The problem can be posed in the 
following way: Suppose End,,M, and End,.M, are isomorphic for two finite dimensional 


vector spaces M; over 4;; what does this imply about the relations between the two spaces? We 


shall see that the vector spaces are semi-linearly isomorphic. We recall that if o is an 
isomorphism of 4, onto 45, then a map s : Mı — M, is called a a-semi-linear map if 


(23) s(x+y)=sx+sy, s(x) = a(0) (sx) 


for x,y € M; 5 € A, (BAL p. 469). It is immediately apparent that if s is bijective, then s~! is a 


o | -semi-linear map of M, onto M}. We shall say that M, and M, are semi-linearly 


isomorphic if there exists a semi-linear isomorphism ( = bijective semi-linear map) of M, onto 
M,. This assumes tacitly that the underlying division rings 4, and 4, are isomorphic. 
Moreover, if {x,} is a base for M; over 4, and s is a semi-linear isomorphism of M, onto M3, 
then {Sxa} is a base for M, over A,. Hence the spaces have the same dimensionality. 
Conversely, let M} over A, and M, over 4, have the same dimensionality and assume that 4, 
and 4, are isomorphic. Let {xa}, {Ya} be bases for M, and M, respectively, indexed by the 
same set, and let o be an isomorphism of 4; onto A,. Define the map s of M; into M, by 


(24) ¥6,x, > Xa(6,)y, (finite sums). 


One checks directly that s is o-semi-linear and bijective. Hence we see that M} over A, and 
M, over A, are semi-linearly isomorphic if and only if they have the same dimensionality and 
A, and A, are isomorphic. 


The key result for the isomorphism theorem (and for representation theory as well) is the 
following 


LEMMA 1. Any two irreducible modules for a simple artinian ring are isomorphic. 


Proof. We choose a minimal left ideal 7 of R, so Z is an irreducible R-module. The result will 
follow by showing that any irreducible module M for R is isomorphic to 7. Since R is simple, 
the representation of R given by M is faithful. Hence, since / # 0, there exists an x e M such 
that Ix # 0. We have the homomorphism b ~ bx of J into M. Since 7x # 0, the homomorphism is 
not 0. By Schur’s lemma, it is an isomorphism O 


We shall require also the following 


LEMMA 2. Let M be a vector space over a division ring A, R = End,M the ring of linear 


transformations of M over A. Then the centralizer of R in the ring of endomorphisms End M 
of M as abelian group is the set of maps 0' :u ~ ôu, ô € A. Moreover, 6 ~ 0' is an 
isomorphism of A onto this centralizer. 


Proof. Since M is a completely reducible A-module, the first statement is a special case of the 
density theorem. We can also give a simple direct proof of this result: Let d be an 
endomorphism of (M, + ,0) that centralizes End,M. Let x # 0 in M. Then dx € Ax. Otherwise, 


x and dx are linearly independent, and hence there is an/ e End,M such that /x = 0 and /(dx) + 
0. Since (dx) = d(x) = 0, this is impossible. Hence for any x # 0 we have a ô, € 4 such that 
dx = 0,x. Let y be a second non-zero element of M and choose / e End,M such that /x = y. 
Then ôy = dy = dix = Idx = ld,x = ô lx = 6,y. Hence ð = 0, for all non-zero x and y, which 
implies that d has the form 6’: u ~ ou. It is clear that ô + 6'is an isomorphism. 0 


We can now prove the 


ISOMORPHISM THEOREM FOR SIMPLE ARTINIAN RINGS. Let Mbe a finite 
dimensional vector space over a division ring A; i = 1, 2, and let g be an isomorphism of R} 
= End, }M, onto R, = End, .M,. Then there exists a semi-linear isomorphism s of M, onto M, 
such that 


(25) g(a) = sas’ 


forallae R}. 


Proof. We consider M, as R, = End,,M, module in the natural way and regard M, as R4- 
module by defining ay = g(a)y for a € R, y €M,. These two modules for the simple artinian 
ring R, are irreducible. Hence they are isomorphic. Accordingly, we have a group 
isomorphism s of M} onto M, such that for any x e€ M}, a € R, we have s(ax) = a(sx). Since ay 
= g(a)y, y € Mh, this gives the operator relation g(a) = sas! which is (25). 

Since § is an isomorphism of M, onto M), the map b ~ sbs™ !, , e End M, is an 
isomorphism of the endomorphism ring End M, of the abelian group M; onto the 
endomorphism ring End M, of the abelian group M3. Since this maps End, M, onto End,.M,, 
its restriction to the centralizer of End ,, M4 in End M, is an isomorphism of this centralizer 
onto the centralizer of End,»/, in End M,. By Lemma 2, the two centralizers are isomorphic 
respectively to A, and A, under the maps sending ô; € A; into the endomorphism ô' : u; + 6,u;. 
Hence we have an isomorphism ø of 4, onto A, that is the composite of ô: ~ 41 ~ sô1s7" with the 
inverse of ô ~ 6’. Thus we have so’,s ! = (o6,)' and hence for any x e M, we have 


8(5,x) = 80,x = sõis (sx) = (aò, )' (sx) = (9, )sx , This shows that s is a o-semi-linear isomorphism 
of M, onto M,. 0 


An immediate consequence of this theorem is that if End, M, and End} M, are isomorphic, 
then M, and M, are semi-linearly isomorphic. Conversely, if s is a semi-linear isomorphism of 
M; onto M, then sas! € End, A, Mo for any a € End, M; and it is immediate that a ~ sas" lis 
an isomorphism of End, M, onto End;,M>. We have seen that M} and M, are semi-linearly 
isomorphic if and only if M} and M, have the same dimensionality and 4, and 4, are 


isomorphic. This shows that a simple artinian ring is determined up to isomorphism by an 
integer n, the dimensionality of M, and the isomorphism class of a division ring A. 


The isomorphism theorem for simple artinian rings also gives a determination of the 
automorphism group of such a ring. If we take R = End,M where M is a finite dimensional 


vector space over the division ring 4, then the isomorphism theorem applied to End,M and 
End, states that the automorphisms of R are the maps 


(26) a~ sas”! 


where s e G, the group of bijective semi-linear transformations of the vector space M. 
Denoting the map in (26) as J,, we have the epimorphism s ~ J, of G onto Aut R, the group of 


automorphisms of R. The kernel of this epimorphism is the set of bijective semi-linear 
transformations s such that sas"! = a for every a € R=End,M. By Lemma 2, any such map has 
the form 6’: x + dx, ô € A. We note also that if ô + 0, then 6’ is a semi-linear automorphism of 
M whose associated automorphism of 4 is the inner automorphism i;: a ~ 6a6 !.This follows 
from 


O'(ax) = d(ax) = (ðxð NOx = (xð LV 5'x, 


Thus the kernel of s ~ J, is the group A” of multiplications x ~ dx determined by the non-zero 
elements 6 of 4. Evidently, our results on automorphisms can be stated in the following way. 


COROLLARY. We have an exact sequence 


(27) 1+ A*5G5AutR—>1 


where A* is the multiplicative group of non-zero elements of A, G is the multiplicative group 
of bijective semi-linear transformations of the vector space M, Aut R is the group of 
automorphisms of R= End M, i is the map 6 ~ ô', and lis the map s ~ J,. 

Of course, this implies that Aut R = G/A*". 

A reader who is familiar with the fundamental theorem of projective geometry (see BAI, 
pp. 468-473) should compare the results obtained here on isomorphisms and automorphisms 
of simple artinian rings with the fundamental theorem and its consequences. We remark that the 
present results could be derived from the geometric theorem. However, it would be somewhat 
roundabout to do so. 

We shall derive next the main result on representations of artinian rings. This is the 
following 


THEOREM 4.4. Any module for a semi-simple artinian ring R is completely reducible and 
there is a 1—1 correspondence between the isomorphism classes of irreducible modules for 
R and the simple components of the ring. More precisely, if R = 8:®°"®®s where the R; are 


the simple components and I, is a minimal left ideal in R,, then {l;,... ls} is a set of 
representatives of the isomorphism classes of irreducible R-modules. 


Proof. Write each simple component as a sum of minimal left ideals of R. This gives R = X4; 
where /; is a minimal left ideal of R contained in a simple component. Now let M be an R- 


module and let x be any non-zero element of M. Then x = 1x e Rx = )J,x. By Schur’s lemma, 
either J,x = 0 or Ix is an irreducible module isomorphic to Z}. Now M = Esx Rx = Leste® and 


this shows that M is a sum of irreducible submodules that are isomorphic to minimal left ideals 
of the simple components of R. Hence M is completely reducible and if M is irreducible, it is 
isomorphic to a minimal left ideal / of a simple component of R. To complete the proof, we 
need to show that if 7 is a minimal left ideal in Rj and J’ is a minimal left ideal in R,, then 7 = 7' 


as R-modules if and only if j = j’. Suppose first that j = j’. Then J and 7' are R,-isomorphic by 
Lemma 1 on p. 205. Since R; = 0 = Rl for k #7’, it follows that J and 7' are R-isomorphic. 
Next assume j # j'. Then RJ = J and RJ = 0; hence / and 7' are not isomorphic as R-modules. O 


We also have a definitive result on irreducible representations for arbitrary (left) artinian 
rings. This is 


THEOREM 4.5. Let R be (left) artinian, N = rad R, and R = R/N = R,®*"®R, where the R; 
are the simple components. For each j,'\--i <8, let I; be a minimal left ideal of R;. Then I; is 


an irreducible module for R, li È 1r (as R-modules) if j + j', and any irreducible R-module is 
isomorphic to one of the I,. 


Proof. Any irreducible R-module M is annihilated by N, so it can be regarded as an 
irreducible R-module. Conversely, any irreducible R-module is an irreducible R-module. It is 
clear also that irreducible R-modules are isomorphic if and only if they are isomorphic as R- 
modules. Hence the theorem is an immediate consequence of Theorem 4.4. 0 


EXERCISES 


— 


. Ifz is a nilpotent element of a ring, then the index of nilpotency is the smallest t such that 
z = 0, z! 4 0. A similar definition applies to nilpotent ideals. Note that if N is a 
nilpotent ideal, then the indices of nilpotency of the elements of N are bounded. Use this 
to construct an example of a nil ideal that is not nilpotent. 


2. Prove that the center of any simple ring is a field. 

3. Show that if R = R:® ƏR, where the R; are simple, then the center C of R is © = C19" ®C, 
where C; = C N R; is the center of R; Hence C is a commutative semi-simple artinian 
ring. Note that the R; are determined by the simple components C; of C since R; = CR. 


4. Let R be left artinian, N = rad R, and assume that M = 0, N} + 0. Show that R/N and 
NIN *!, 1 <i<t-— 1, are completely reducible modules for R of finite dimensionality. 
Use this to prove that pR has a composition series and hence that R is left noetherian. 


The next four exercises are designed to give conditions for anti-isomorphism of simple 
artinian rings and to determine the anti-isomorphisms and involutions of such rings. 


5. Let V be a vector space over a division ring A that has an anti-automorphism/:6 ~ 6. A 
sesquilinear form on V (relative to j) is a map fof V x V into A such that 


F(X, txa y) =f (X,Y) +f (X,Y) 
F(X yi +y2) = f(x,y) +f (x, y2) 
f(dx,y) = òf (x,y), f(x,y) = f(x, y)ð 


for x,y € V, ô e€ A. Call f non-degenerate iff (zy) = 0 for all y implies z = 0 and f (x, z) 
= 0 for all x implies z = 0. Show that if V is finite dimensional and f is non-degenerate, 
then for a linear transformation / in V over A there exists a (left) adjoint relative to f 
defined to be a linear transformation /’ such that Kix, y) = fx, /'y) for all x,y. Show that / 
~ l'is an anti-isomorphism and that this is an involution if f is hermitian in the sense 
that fy,x) =f») for all x,y or anti-hermitian: f (y,x) = — fœ»). 


6. Let V and A be as in exercise 5 and let V* be the dual space hom,(V,4). This is a right 


vector space over A, hence a left vector space over A®, If/ is a linear transformation of 
V over A, let /* be the transposed linear transformation in V*. Show that / + /* is an 
anti-isomorphism of End,V onto End V*,. Hence show that if V,i = 1, 2, is a finite 


dimensional vector space over a division ring 4; then End, V, and End, V, are anti- 
isomorphic if and only if 4, and 4, are antiisomorphic and V} and V, have the same 
dimensionality. 

7. Let V be a finite dimensional vector space over a division ring 4 and assume End ,V has 
an anti-automorphism J. Note that J(/) ~ /* is an isomorphism of End,V onto End jop V* 


where V* is regarded in the usual way as left vector space over A? (ax* = x*a). Hence 
there exists a semi-linear isomorphism s of V over A onto V* over 4° such that 


* = sJ(I)s~! 
for l e End, V. Ifx, y € V, define 
f(x,y) = (sy) (x). 


Show that f is a non-degenerate sesquilinear form on V corresponding to the 
antiautomorphism o that is the isomorphism of A onto A? associated with s and that J is 
the adjoint map relative to f. 


8. (Continuation of exercise 7.) Show that if J is an involution (J? = 1), then either fis skew 
hermitian or f can be replaced by pf, p # 0, in A so that pf is hermitian and J is the 
adjoint map relative to pf (Sketch: Note that for any v € V, the map x ~ oY (v, x) is a 


linear function, so there exists a v'e V such that f(v,x) = £00), x e V. Show that for any 
u € V the map Lx ~ f(x,u)v is in End, V and its adjoint relative to fis y + o 'Av,y)u = 
fly.v'ju. Conclude from J7(/) = / that there exists a 5 # 0 in A such that f(y) = 40.) for 


all x,y € V. Show that 6 = 5! and if ô #— 1, then pf for p = 6 + 1 is hermitian and J is 
the adjoint map relative to pf.) 


9. Note that the Wedderburn-Artin theorem for simple rings has the following matrix form: 
Any simple artinian ring is isomorphic to a matrix ring M,(A'), A’ a division ring (A’ = 
A? if A is as in the statement of the theorem). Note that the isomorphism theorem implies 
that if A’; and A’, are division rings, then M7, ,(A’,) = M,,.(A’>) implies that n; = n, and A 
', = A’,. Show that the isomorphism theorem implies also that the automorphisms of 
MA‘) have the form A ~ S(°A)S! where S € M,(A’), o is an automorphism of A, and 
°A = (0(a,;)) for A = (aj). 


10. Formulate the results of exercises 6—8 in matrix form. 


1.5 STRUCTURE THEORY OF ALGEBRAS 


In this section we shall extend the structure theory of rings that we developed in sections 1—4 
to algebras over commutative rings. We shall then specialize to the case of finite dimensional 
algebras over a field. 

We recall that an algebra A over a commutative ring K is a (left) K-module and a ring such 
that the module 4 and the ring A have the same additive group (4, + ,0) and the following 
relations connecting the K-action and ring multiplication hold: 


(28) k(ab) = (ka)b = a(kb) 
for a, be A, ke K (p. 44). We have the ring homomorphism 
(29) gtk ~>kl 


of K into A (p. 138). The image under € is K1 = {KI |k e K}. This is contained in the center of 
A, and ka for k e K, a e A, coincides with (kl)a = a(k1). Conversely, given a ring R and a 
subring K of the center of R, R becomes a K- algebra by defining ka for k € K, a € R, to be the 
ring product ka. We remark also that rings are Z-algebras in which na for n € Z, a € R, is the 
nth multiple of a. Hence the theory of rings is a special case of that of algebras. 

The prime examples of K-algebras are the algebras EndgM where M is a K- module for the 
commutative ring K. Here kf for k e Kf € End,M, is defined by (Af) (x) = kf) =Akx), x€ M 


(p. 139). If A is an arbitrary K-algebra, then we define a representation of A to be a (K- 
algebra) homomorphism of A into an algebra End,M. If p is a representation of A acting in the 


K-module M, then we define ax = p(a)x, a € A, x € M, to make M an A-module as well as a K- 
module. Since p(ka) = kp(a), k € K, and p(a) is a K-endomorphism, we have 


(30) k(ax) = (ka)x = a(kx) 
for ke K,ae A,x € M. We now give the following 


DEFINITION 4.3. If A is an algebra over the commutative ring K, a (left) A- module is an 
abelian group written additively that is both a (left) K-module and a (left) A-module such 
that (30) holds for all ke K,a € A, xe M. 


If M is a module for A, we obtain a representation pj, of A by defining p ọy(a) for a € A to 
be the map x ~ ax. This is contained in End,M, and a ~ p,(a) is an algebra homomorphism. It 


follows from (30) that kx = (kDx. Hence, if M and N are modules for the algebra A, then any 
homomorphism f of M into N regarded as modules for the ring A is also a K-homomorphism. It 
is clear also that any A-submodule is also a K-submodule and this gives rise to a factor module 
MIN for the algebra A in the sense of Definition 4.3. We remark also that if M is a ring A- 
module, then M is a K-module in which kx = (k1)x, and (30) holds; hence, this is an algebra A- 
module. The algebra A itself is an A-module in which the action of A is left multiplication and 
the action of K is the one given in the definition of A. The submodules are the left ideals. These 
are the same as the left ideals of A as ring. If k e K and fe hom,(M,N) where M and N are 


modules for A, then kf is defined by (Af)(x) = kx), x e M. This is contained in hom ,(M, N), 
and this action of K on hom,(,N) is a module action. It is immediate that End ,M is an algebra 
over K. 


We can now carry over the definitions and results on the radical, primitivity, and semi- 
primitivity to algebras. An algebra is called primitive if it has a faithful irreducible 
representation, semi-primitive if there are enough irreducible representations to distinguish 
elements. The radical is defined to be the intersection of the kernels of the irreducible 
representations of the algebra. The results we proved for rings carry over word for word to 
algebras. We have observed that if A is an algebra over K, then the left ideals of A as ring are 
also left ideals of A as algebra and, of course, the converse holds. Since the radical of A as 
algebra or as ring 1s the intersection of the maximal left ideals, it is clear that rad A is the same 
set with the same addition and multiplication whether Æ is regarded as ring or as algebra. 
Similarly, A is primitive (semiprimitive) as algebra if and only if it is primitive (semi- 
primitive) as ring. 

The structure and representation theory of artinian rings carries over to algebras. In 
particular, the theory is applicable to algebras that are finite dimensional over fields, since any 
left ideal in such an algebra is a subspace and the descending chain condition for subspaces is 
satisfied. For the remainder of this section, except in some exercises, we consider this 
classical case: A a finite dimensional algebra over a field F. Then rad A is a nilpotent ideal 
containing every nil one-sided ideal of A. If rad A = 0, then A = 41®42®°*'®A, where the A, are 


ideals that are simple algebras. If Æ is simple, then A is isomorphic to the algebra of linear 


transformations in a finite dimensional vector space M over a division algebra A. Here M is 
any irreducible module for A and A is the division algebra End ,M given by Schur’s lemma. We 


recall also that M can be taken to be A/I where J is a maximal left ideal of A and A is anti- 
isomorphic to B/I where B is the idealizer of J (Proposition 4.7, p. 203). Hence A is a finite 
dimensional division algebra over F. Since End,M for M n-dimensional over A is anti- 


isomorphic to M,(A) and M,(A) is anti-isomorphic to M,(A°?), it is clear that A is a finite 
dimensional simple algebra over F if and only if it is isomorphic to an algebra M,(A'), A’ a 
finite dimensional division algebra. These results were proved by Wedderburn in 1908. 

In BAI, pp. 451-454, we determined the finite dimensional division algebras over 


algebraically closed, real closed, and finite fields. We showed that if F is algebraically 
closed, then F is the only finite dimensional division algebra over F and if F is real closed, 
then there are three possibilities for finite dimensional algebras over F: (1) F, (2) F(Y 7»), 
and (3) the quaternion algebra over F with base (1,i,/,k) such that i? = — 1 =? and ij = k =—i. 
If F is finite, then one has Wedderburn’s theorem that the finite dimensional algebras over F 
are fields. There is one of these for every finite dimensionality and any two of the same (finite) 
dimensionality are isomorphic (see BAI, pp. 287—290). Combining these results with the 
structure theorems, one obtains a complete determination of the finite dimensional semi-simple 
algebras over algebraically closed, real closed, or finite fields. 

The results on representations of artinian rings are also applicable to finite dimensional 
algebras over fields. Thus if A over F is semi-simple, then any representation of A is 
completely reducible and we have a 1—1 correspondence between the isomorphism classes of 
irreducible modules and the simple components of the algebra. 

One can apply the theory of algebras to the study of arbitrary sets of linear transformations 
in a finite dimensional vector space V over a field F. If S is sucha set, we let Env S denote the 
subalgebra of End; V generated by S. We call this the enveloping algebra of S. Evidently, Env 


S consists of the linear combinations of 1 and the products s,55...s,, S; E S, and the 
dimensionality dim Env S< n? where n = dim V. The injection map into End;V is a 


representation of Env S. The theory of algebras is applicable to the study of S via this 
representation. We shall now apply this method and the density theorem to obtain a classical 
theorem of the representation theory of groups. This is 


BURNSIDE’S THEOREM. Let G be a monoid of linear transformations in a finite 
dimensional vector space V over an algebraically closed field F that acts irreducibly on V 
in the sense that the only subspaces stabilized by G are V and 0. Then G contains a base for 
End; V over F. 


Proof. The hypothesis is that G contains | and G is closed under multiplication. Then Env G 
= FG, the set of F-linear combinations of the elements of G. Then V is an irreducible module 
for A = Env G and A'= End , V is a division algebra by Schur’s lemma. Evidently, A’ is the 


centralizer of A in Ende V, so A’ is a finite dimensional algebra over F. Since F is 
algebraically closed, we have A’= F1. By the density theorem, End} V = A. Hence A = FG = 
End;V. Evidently this implies that G contains a base for End-V. 0 


If S is an arbitrary set of linear transformations in V over F that acts irreducibly, then 
Burnside’s theorem can be applied to the submonoid G generated by S. The result one obtains 
is that there exists a base for End;V consisting of 1 and certain products of the elements of S. 


EXERCISES 


In these exercises A is an algebra over a field F. In exercises 1—5, A may be infinite 
dimensional; thereafter all vector spaces are finite dimensional. 


1. Leta € A and let a), ..., a,, be elements of F such that a — a,l is invertible in A. Show 


that either a is algebraic over F or the elements (a — a1) 4, 1 <i <m, are linearly 
independent over F. 


2. Let A be a division algebra with a countable base over an uncountable algebraically 
closed field F (e.g., F = €). Show that A = F. 


3. Let A be finitely generated over an uncountable algebraically closed field F and let M be 
an irreducible A-module. Show that End,M = F. 


4. Show that the elements of rad A are either nilpotent or transcendental over F. 
5. Let A be finitely generated over an uncountable field F. Show that rad A is a nil ideal. 


6. Assume F contains n distinct nth roots of 1 and let ¢ be a primitive one of these. Let s and 
t be the linear transformations of the vector space V/F with base (x1, x2, ..., X„) whose 


matrices relative to this base are respectively 


l 0 


Verify that 
f=al=f". st = ets, 


and that Env {s, t} = End, V. Let u and v be linear transformations of a vector space 


WIF such that u” = 1 = v”, uv = evu. Apply the representation theory of simple algebras 
to show that dim W =r dim V and there exists a base for W over F such that the matrices 
ofu and v relative to this base are respectively diag {0, 0,..., 0} and diag {t, T, ..., T}. 


7. Ifa is a linear transformation in a vector space V over F with base (x), ..., x,,) and ax; = 
»a,,x;, then the trace of a, tr a = Xa; is independent of the choice of the base (BAL p. 
196). Moreover, tv a is the sum of the characteristic roots of a. Let 


t(a.b) = trab. 
Show that (a,b) is a non-degenerate symmetric bilinear form on End; V (BAL, p. 346). 


8. Let G, V, F be as in Burnside’s theorem. Suppose tr G = {tr ala € G} is a finite set. 
Show that G is finite and |G] < |tr GI", n = dim V. (Hint: G contains a base (q),..., a„2) 
for End;V by Burnside’s theorem. Consider the map a ~ (t(a, a1), t(a, a,..., t(a, a„2)) 
where ¢(a, b) = tr ab as in exercise 7.) 

9. A linear transformation is called unipotent if it has the form 1 + z where z is nilpotent. 
Let G be a monoid of unipotent linear transformations in a finite dimensional vector 


space V over an algebraically closed field F. Prove Kolchins theorem that V has a base 
relative to which the matrix of every a € G has the form 


0 1 


(Hint: Let V= V1 > V2 D... D V, +1 = 0 be a composition series for V as A = FG- 


module and apply exercise 7 to the induced transformations in the irreducible modules 
V/V; . 1.) Extend the theorem to arbitrary base fields F. 


10. (Burnside.) Let G be a group of linear transformations in a finite dimensional vector 
space V over an algebraically closed field F such that there exists an integer m not 
divisible by the characteristic of F such that a” = 1 for all a e G. Prove that G is finite. 
(Hint: If G acts irreducibly, the theorem follows from exercise 8. Otherwise, we have a 
subspace U # 0, # V stabilized by G. Use induction on the dimensionality to conclude 
that the induced groups of transformations in U and in V/U are finite. Let K, and K, be 
the kernels of the homomorphisms of G onto the induced groups in U and V/U. Then K, 
and K, and hence K, N Ķ;, are of finite index in G. Finally, show that K, N K, = 1.) 


1.6 FINITE DIMENSIONAL CENTRAL SIMPLE ALGEBRAS 


An algebra A over a field F is called central if its center C= F1 = {alla € F}. If A is a finite 
dimensional simple algebra over A, then A = M,(A) where A is a finite dimensional division 


algebra over A. The center C of A is isomorphic to the center of A and hence C is a field (cf. 
exercise 2, p. 209). Now 4 can be regarded as an algebra over C. When this is done, A 


becomes finite dimensional central simple over C. As we shall see in this section and the next, 
the class of finite dimensional central simple algebras has some remarkable properties. To 
facilitate their derivation it will be useful to collect first some simple results on tensor 
products of modules and algebras over fields that will be needed in the main part of our 
discussion. 

We note first that since any F-module ( = vector space over F) is free, it is projective and 
thus flat (p. 154). Hence if we have an injective linear map i: V’ — V of F-modules, then i @ 1 
and 1 @i are injective maps of V'® U— V ® U and of U® V' into U® V respectively for any 
F-module U. If {xla € 7} is a base for V/F, then any element of U ® V can be written in one 
and only one way as a finite sum 24,,®*,,, 4,,€U, The fact that it is such a sum is clear since any 
element of U ® V has the form £4®6;4€U.5;€V¥. The uniqueness follows from the 
isomorphism of U@V = U@LFx = ®U®Fx, (Proposition 3.3, p. 131), which implies that if 
24, @x = 9, then every "s ® Xa = 9, Then the isomorphisms U@Fx, = U@F = U imply that a, ® 
Xa = 0 if and only if a, = 0. Similarly, if {yg € J} is a base for U, then every element of U ® 
V can be written in one and only one way as &!,®, 5€V, It is clear also that if {xzla € J} is a 
base for V and {ya} € J} is a base for U, then {yz @ xj € j,a € I} is a base for U ® V. 
Hence the dimensionality [U V: F] =[U: F][V: F]. 

If A and B are algebras over the field F, then A ® B contains the subalgebras 1 ® B = {1 ® 
bjb € B} and A @ 1 = {a @ lla e A}. These are isomorphic to B and A respectively under the 
maps b ~ 1®@banda-~ a®1. Hence we can identify B with the subalgebra 1 ® B and A with 
the subalgebra 4 @ 1 of 4® B. If we write a for a @ 1 and b for 1 ® b, then we have ab = ba 
and A ® B = AB = BA. 

We shall now prove the following result, which in effect gives internal characterizations of 
tensor products of algebras over a field. 


PROPOSITION 4.8. Jf A and B are subalgebras of an algebra D over a field F, then D =A 
®-B if the following conditions are satisfied: 


(1) ab=ba for alla € A,b € B. 
(2) There exists a base (x,) for A such that every element of D can be written in one 
and only one way as L)iX.,.%€ 8B, 
If D is finite dimensional, then condition (2) can be replaced by 
(2') D = AB and [D: F] = [A: F] [B: F]. 


Proof. The first condition implies that we have an algebra homomorphism of 4 ® B into D 
such that a ® b ~ ab, a € A, b € B (p. 144). The second condition insures that this is bijective, 
hence, an isomorphism. It is immediate that (2^) = (2) if D is finite dimensional. O 


As a first application of this criterion, we prove 


PROPOSITION 4.9. IfB is an algebra over F, then M,(B) = M,(F)®pB. 


Proof. This follows by applying the criterion to the subalgebras M,„(F) and B1 = {bl|b € B} 
where 


b 0 


If {e,|1 < i, 7 < n} is the usual base of matrix units for M„(F), then (b1)e; = e;(b1). Hence 
condition (1) holds. Moreover, (b1)e; is the matrix that has b in the (7,/)-position and 0’s 
elsewhere. This implies condition (2). O 


Next we prove 


PROPOSITION 4.10. M,,(F)®M,,(F) = M,,,(F). 


Proof. This follows by applying the criterion to the subalgebras consisting of the matrices 


where be M,(F), and of the matrices 


Gul, «+ > Gal, 
Riis o u Enn lm a,€ F, 
respectively. We leave the details to the reader. O 


If A is any algebra, we define A° = A ®,A’? and we call this the enveloping algebra of A. If 


A is a subalgebra of another algebra B, then we have a natural module action of A€ on B 
defined by 


(31) ().a;@D;)y = >a; yb; 


where the a; € A, b; e AP, y € B. It is clear from the universal property of the tensor product 
that we have a homomorphism of 4° into B such that 24'@4;~ Zayb; Hence (31) is well- 
defined. Direct verification shows that it is a module action. In particular, we have a module 
action of A° on A. The submodules of A as A-module are the ideals of A. Hence if A is simple, 
then A is A®-irreducible. We recall that the endomorphisms of 4 regarded as left (right) A- 


module in the natural way are the right (left) multiplications x ~ xa (x ~ ax). It follows that 
End ,eA is the set of maps that are both left and right multiplications x ~ cx = xd. Putting x = 1, 


we obtain d = c; hence End jeA is the set of maps x ~ cx, c in the center. If A is central, this is 


the set of maps x ~ (a1)x = ax, a € F. We shall now apply this to prove our first main theorem 
on finite dimensional central simple algebras. 


THEOREM 4.6. Jf A is a finite dimensional central simple algebra over a field F then 
A‘ = A®pA™ = MF) where n = dim A. 


Proof. We regard A as A°-module as above. Then A is irreducible and End,c4d = F. Also A is 
finite dimensional over F. Hence by the density theorem A° maps onto End;d. Since both A’ 


and EndpA have dimensionality n°, we have an isomorphism of A° onto EndpA. Since End, = 
M,,(P), the result follows. 0 


We consider next how a finite dimensional central simple algebra A sits in any algebra B 
(not necessarily finite dimensional) containing A as subalgebra. The result is 


THEOREM 4.7. Let A be a finite dimensional central simple subalgebra of an algebra B. 
Then B = A @,C where C is the centralizer of A in B. The map I ~ AI is a bijection of the set 


of ideals of C with the set of ideals of B. Moreover, the center of B coincides with the center 
of C. 


Proof. We regard B as A°-module as above. By the preceding theorem, A° is simple. Hence B 
is a direct sum of A®-irreducible modules all of which are isomorphic to A (since A is A®- 
irreducible and any two irreducible A°’-modules are isomorphic). Now observe that the 
generator 1 of A as A-module satisfies (a @ 1)1 = al = la = (1 ® a)l and (a @ 1)1 = 0 implies 
a = 0. Hence in any irreducible A°-module, we can choose a c such that (a ® 1)c = (1 @ ajc 
and (a ® 1)c = 0 implies a = 0. Applying this to B as A-module, we see that we can write B = 
@Ac, where ac, = ca for all a E A and ac, = 0 implies a = 0. Then c, € C and any element of 


B can be written in one and only one way as a finite sum )’a,c,, a, E AIf c e Cwe have c = 
Ža,Ca and ac = ca implies aa = aa,. It follows that every a, € Fl and c e $} Fc, This 
implies that C= J Fc, and {c,} is a base for C. It is now clear from Proposition 4.8 that A ® 
pC=B. 

Now let J be an ideal in C. Then Al is an ideal in B = AC. Moreover, AJ N C= I. For, if (x, 
=]; X5,..., X„) is a base for A, then the canonical isomorphism of A ®@,C onto B (p. 144) implies 
that any element of B can be written in one and only one way as Lieix, c€C, Then the elements 


of AI have the form }'dx,, d; € I, and if such an element is in C, it also has the form cx, c4 € 
C. It follows that C N Al is the set of elements dix; = d, € J. Thus C N Al = J, which implies 
that the map / + AI of the ideals of C into ideals of B is injective. To see that it is surjective, 
let /' be any ideal of B. Then 7' is a submodule of B as A°-module. Hence 7' = }Adp where dp € 
I=I' N C, which implies that /'= AI Thus the map J ~ Al is also surjective. 

It remains to prove that the center of B is the center of C. Since the center of B is contained 


in C, it is contained in the center of C. On the other hand, if c is contained in the center of C, it 
commutes with every element of B = AC and so it is in the center of B. O 


It follows from the preceding theorem that B is simple if and only if C is simple and B is 
central if and only if C is central. The theorem can be applied to tensor products in which one 
of the factors is finite dimensional central simple. Thus let B = A ®pC where A is finite 


dimensional central simple. We identify A and C with the corresponding subalgebras A ® 1 and 
1 ® CofA®C. Now we claim that C is the centralizer of A in B. For, let {yp} be a base for C 


over F. Then any element of B can be written in one and only one way as b = X agyp, ag E€ A, 
and ab = ba for a € A is equivalent to aag = aga for every a. This implies that b commutes 
with every a € A if and only if b = )iagy, ag E F, that is, if and only if b e C. We can now 
apply Theorem 4.7 to obtain the 


COROLLARY 1. Let A be a finite dimensional central simple algebra over F, C an 
arbitrary algebra over F. Then the map I ~ A @,l is a bijection of the set of ideals of C with 


the set of ideals of B = A ®;C. Moreover, the center of B coincides with the center of C 
(identified with the corresponding subalgebra of B). 


An important special case of this result is 


COROLLARY 2. Let A be a finite dimensional central simple algebra over F, C any 
algebra over F Then A @,C is simple if C is simple and A @,C is central if C is central. 


Iteration of Corollary 2 has the following consequence. 


COROLLARY 3. The tensor product A, @pA, @p...@p4,0f a finite number of finite 
dimensional central simple algebras is finite dimensional central simple. 


Of particular interest are tensor products in which one of the factors is an extension field E 
of the base field F. If A is any algebra over F, then A ®;E can be regarded as an algebra over 


E. This algebra over E is denoted as A, and is called the algebra obtained from A by 
extending the base field to E (see exercise 13, p. 148). If (x,) is a base for A over F, then by 
identifying A with A ® 1 in Ap we can say that (xa) is a base for A; over E. Hence the 
dimensionality of A over F is the same as that of A; over E. It follows also that if K is an 


extension field of Z, then (Ag) = Ax. If B is a second algebra over F with base (vg), then A ® 
FB has the base (x,,) (= (Xa ® yp) over F and this is also a base for (4 ®,B); over E and Az 
®,B, over E. It follows that (A ®,B); = Ag ®pBp. (More general results are indicated in 
exercise 13, p. 148.) 


If A is finite dimensional central simple over F, then Corollary 2 to Theorem 4.7 shows that 
A, is finite dimensional central simple over E. We recall also that M,,(F) is finite dimensional 


central simple over F. These are the simplest examples of central simple algebras. We now 
give 


DEFINITION 4.4. The matrix algebra M,(F) is called a split central simple algebra over 


F If A is finite dimensional central simple over F then an extension field E of F is called a 
splitting field for A if Aç is split over E (Ag = M,(E)for some n). 


Evidently any extension field of F is a splitting field of M„(F) and if E is a splitting field 
for A and B, then £ is a splitting field for the central simple algebra A @,B. This follows since 
Ag = M,(E£) and Bg = M„(E) imply (4 ®FB)g = Ap DB g = M, (E) ®,-M,,(E£) = M,,,(E). We note 
also that if E is a splitting field for A, then it is a splitting field for A”. For, (4P); = (Ap)? = 
M,(E)°?, and since M,,(£) has an anti-automorphism, M,(£)°? = M,(£). If A = M(B) where B 
is central simple over F then F is a splitting field for A if and only if it is a splitting field for B. 
We have A; = M,(Bp) since M(B) = M(F) @,B and M(B), = MF), Bg = M(B;). Now 
Bg = MA) where A is a central division algebra over E. Then A; = M..,(A). If E splits A (that 
is, is a splitting field for A) then A; = M,(£) and hence M,(£) = M,„(A) which implies that rs 
=n and A= E. Then By = M(E) and E splits B. The converse is clear from Proposition 4.10. 
In particular, if A = M@(A) where A is a central division algebra over F then F splits A if and 
only if E splits A. Thus it suffices to consider splitting fields for central division algebras. We 
remark finally that any extension field K of a splitting field is a splitting field. This is clear 
since (Ar) = Ag and M,(E)g = M, (4). 

If E is an algebraically closed extension field of F, then the only finite dimensional simple 
algebras over E are the algebras M,(E) (p. 213). Accordingly, any algebraically closed 
extension field £ of F is a splitting field for every finite dimensional central simple algebra 
over F. We shall see later that every field can be imbedded in an algebraically closed field 
(section 8.1). When this result is available, it proves the existence of splitting fields for finite 
dimensional central simple algebras. 

We shall now prove the existence of finite dimensional splitting fields for any finite 
dimensional central simple algebra A. Writing A = M,(A) where A is a finite dimensional 
central division algebra, we shall show that any maximal subfield of A is a splitting field. This 
will follow from the following result (which we shall improve in Theorem 4.12). 


THEOREM 4.8. Jf A is a finite dimensional central division algebra over F then a finite 
dimensional extension field E of F is a splitting field for A if and only if E is a subfield of an 
algebra A = M, (A) such that the centralizer C,(E) = E. 


Proof. Suppose the condition holds: F is a subfield of A= M,(A) such that C,(£) = E. We can 
identify A with End ,,V where V is an r-dimensional vector space over A’ = A. Then V is a A’ 
®,E module such that (d ® e)x = dex = edx ford € A’,e e ECA. Since A’ ®pE is simple, the 
representation of A’ ®pE in V is faithful and we can identify A’ ®pE with the corresponding 
ring of endomorphisms in V. Since A’ ®pE is finite dimensional simple over F, V is completely 
reducible as A’ ®,E module and so we can apply the density theorem. Now End,V = A so 
End, @ gV is the centralizer of E in A. By hypothesis, this is E. Now V is finite dimensional 


over F since it is finite dimensional over A’ and A’ is finite dimensional over F. Thus V is 
finite dimensional over E£. Hence, by the density theorem, A’ ®pE is isomorphic to the complete 


algebra of linear transformations in V over E. If the dimensionality [V : E] = n, then A’ @-F = 
M,(£). Thus £ is a splitting field for A’ over F and hence of A over F. 


Conversely, suppose A @-F = M,(£). Then also A’ ®@-E = M,(E). Let V be an irreducible 
module for A’ ®pE. Since A’ ®-F = M,(E), V is an n-dimensional vector space over E and A’ 
®-E can be identified with End;V. Also V is a vector space over A’ and if [V : A’] =r, then 
since E centralizes A’, E c End, V. The centralizer of E in Enda, V is contained in the 
centralizer of A’ @E in End;V. Since A’ E = End;V, this is E. Hence Cena, AZ) = E and 


Cuna) (E) = £. O 


The existence of a maximal subfield of A is clear since [A : F] < œ. We now have the 
following corollary, which proves the existence of finite dimensional splitting fields for any 
finite dimensional central simple algebra. 


COROLLARY. Let A be finite dimensional central simple over F and let A= M (4) where 
A is a division algebra. Then any maximal subfield E of A is a splitting field for A. 
Proof. It suffices to show that £E is a splitting field for A. Let E'= C,(£). Then E'> E and if E' 


+ E, we can choose c € E’ € E. The division algebra generated by E and c is commutative, 
hence this is a subfield of A properly containing E, contrary to the maximality of £. Thus C,(£) 


= E. Hence Theorem 4.8 with A = A shows that Æ is a splitting field. 0 


If E is a splitting field for A so that A, = M,,(E), then [A : F] = [Ar : E] = [M,(E) : E] = n°. 
Thus we see that the dimensionality of any central simple algebra is a square. The square root 
n of this dimensionality is called the degree of A. 


We shall prove next an important theorem on extension of homomorphisms: 


THEOREM 4.9. Let A be a simple subalgebra of a finite dimensional central simple 
algebra B. Then any homomorphism of A into B can be extended to an inner automorphism 
of B. 


Proof. We form the algebra E = A @,B°?, which is finite dimensional and simple. Any module 


for E is completely reducible and any two irreducible E-modules are isomorphic. It follows 
that any two E-modules of the same finite dimensionality over F are isomorphic. We now make 
B into an E-module in two ways. In the first, the action is (ia; ® b;)x = Ya,xb; and in the 
second it is Q4,@)x = 2 f(a)xb; where fis the given homomorphism of A into B. Clearly these 
are module actions and the two modules are isomorphic. Hence there exists a bijective linear 
transformation / of B over F such that 


(32) Xa;l(x)b; = (Xf (a;)xb;) 


for all a; € A, x,b; € B. In particular, /(x)b = I(xb) for all x,b € B. It follows that / has the form 
x~ dx where d is an invertible element of B. We have also al(x) = I(f(a)x) for alla € A, x € 
B. Then adx = df(a)x. Putting x = 1 we obtain f(a) = d ‘ad. Hence f can be extended to the 
inner automorphism b ~ d !bd in B. O 


If we take A = F1 in Theorem 4.9 we obtain the important 


COROLLARY (Skolem-Noether). Any automorphism of a finite dimensional central simple 
algebra is inner. 


We prove next an important double centralizer theorem for central simple algebras. 


THEOREM 4.10. Let A be a semi-simple subalgebra of a finite dimensional central simple 
algebra B. Then the double centralizer Cp(Cp(A)) = A. 


Proof. Consider the algebra B @B°P and its subalgebra A ®@B°?. If we identify B with B @ 1, 
then it is easily seen as in the proof of Theorem 4.7 that B N (A @B°?) = A. Now regard B as B 
® B°P-module in the usual way. By the proof of Theorem 4.6, the algebra B ® B? is isomorphic 
to the algebra of linear transformations of B over F of the form x ~ ){b;xb', where b,b; € B. 
Then the result that B N (A @B°P) = A implies that if a linear transformation of B is 
simultaneously of the form x ~ )’axb; where a; € A, b; € B, and of the form x ~ bx, b € B, 


then b =a e A. We now consider the algebra A ® B°?. We claim that this is semi-simple. To see 


this we write A = A, ® ... ® Ay where the A; are simple ideals. Then A ® B” = (A, @ B9) @ ... 
@(A, ® BP) and A; ® B? is an ideal in A ® BP. Since A; is simple and B” is finite dimensional 
central simple, A; ® BP is simple by Corollary 2, p. 219. Thus 4 ® B® is a direct sum of 
ideals that are simple algebras. Hence A ® B? is semi-simple. Now regard B as A ® B°- 
module by restricting the action of B® B? to the subalgebra A ® BP., Since A @ B? is semi- 
simple, B is completely reducible as E = A ® BP -module. Now consider £'= End,B. If we 
use the fact that Ihe endomorphisms of B as right B-module are the left multiplications, we see 
that E’ is the set of maps x ~ cx, c € C,(A). Since B is finite dimensional over F and B is 
completely reducible as A ® B°?-module, it follows from the density theorem that End,B is the 
set of maps * ~ £4)*b),a)€ A, bieB Now let c' € Cp(C,(A)). Then x ~ cx commutes with x ~ cx 
for c € Cp(A) and with x ~ xb for b € B. It follows that x ~ cx is in End, B. Hence x ~ cx 
also has the form x ~ )'axb;, a; € A, b; € B. We have seen that this implies that c'e A. Thus 
we have proved that Cp(Cp(A)) c A. Since A c Cp(C2(A)) is clear, we have Cp(Cp(A)) = A. O 


The foregoing result does not give us any information on the structures of the various 
algebras involved in the proof or on their relations. We shall now look at this question and for 
the sake of simplicity, we confine our attention to the most interesting case in which A is a 
simple subalgebra of the central simple algebra B. The extension to A semi-simple is readily 
made and will be indicated in the exercises. 


Assume A simple. Then £ = A ® BP is simple, so E = MA) where A is a division algebra. 
Let d=[A: F]. Then we have 


(33) [E:F] =[A:F][B:F] =r'd. 


If {e; |1 < Lj <r} is the usual set of matrix units for M,(A), then it is clear that A = e),M,(A)e;, 
which is just the set of matrices with (1, 1)-entry in A and other entries 0. Also, M,(A) is a 
direct sum of the r minimal left ideals M,(A)e; and these are isomorphic modules for the 
algebra M,(A). Each of these irreducible modules has dimensionality r7d/r = rd over F. Now 


consider B as E = A ® B°P-module. Since E is simple, B as E-module is a direct sum of, say, s 
irreducible submodules all of which are isomorphic to minimal left ideals of E. Since E = 
M (A), the dimensionality over F of any irreducible E-module is rd. Hence we have 


(34) [B:F] = srd. 
By (33) and (34) we see that s|r and 
(35) [A:F] = rfs. 


Now consider Cg(4) = E' = End;B. Since B is a direct sum of s irreducible E-modules and 


these are isomorphic to minimal left ideals Ee, e? = e, of E, we can determine the structure of 
End,B, hence of Cp(A), by using the following 


LEMMA. Let e be a non-zero idempotent in an algebra E, I = Ee, I® the direct sum of s 
copies of I. Then Endl) is isomorphic to M,((eEe)?). 


Proof. By Lemma 2, p. 198, Endl) consists of the maps (u4,..., Uy) + (Vis... Vs) where v; = 
Yja'ju; and a'j € Endgl. This gives a map (a'j) + n(a'j) of M,(Endg/) into End,/). Direct 
verification shows that this is an isomorphism. Next we consider End,/ = End;Ee. As in the 
proof of Lemma (1), p. 180, End,;Fe is the set of maps ae ~ aebe, a, b € E. In this way we 
obtain a map of eZe into End;Ee such that ebe ~ n(ebe):ae ~ aebe. Direct verification shows 


that this is an anti-isomorphism. Hence End,J = (eEe)°? and End; = M,((eEe)?). O 


Applying this lemma to B as E-module, we see that End-B = M,((e,,M(A)e,,)%) = 
M,(A®). Hence Cp(A) = M,(A°?) and 


(36) [C,(A): F] = s7d. 
Then 
(37) [A:F][Cg(A):F] = (r/s)s’d = rsd = [B: F]. 


We summarize these results in 


THEOREM 4.11. Let A be a simple subalgebra of a finite dimensional central simple 
algebra B. Then [B : f| = [A : F] [C3(A) : F] and A ® BP = M (A), A a division algebra, and 
Cg(4) = M,(A). Moreover, s|r, [B : F] = rsd where d=[A: F] and [A : F] = r's. 


The relation [B : F] = [A : F][C}(4) : F] shows that if A is a field, then Cp(A) = Ae [B : F] 


= [A : F]?. This gives the following precise determination of the finite dimensional splitting 
fields of a central division algebra. 


THEOREM 4.12. A finite dimensional extension field E of F is a splitting field for a 
central division algebra A if and only if [E : F| = nr where n is the degree of A, and E is 
isomorphic to a subfield of M (A). 


EXERCISES 


. Let A be a finite dimensional central division algebra over F, E a maximal subfield of A. 
Show that [A : F] = [E : FF. 

. (Proof of Wedderburn’s theorem on finite division rings.) Let A be a finite division ring, 
F the center of A so A is a central division algebra over F. Show that if £1 and E, are 
maximal subfields of A, then these are conjugates in the sense that there exists an s € A 
such that E, = sE,s '. Conclude from this that the multiplicative group A* of non-zero 
elements of A is a union of conjugates of the subgroup £,*. Hence conclude that A = £; = 
F is commutative (see exercise 7, p. 77 of BAI). 

. (Proof of Frobenius’ theorem on real division algebras.) Let A be a finite dimensional 
division algebra over R. If A is commutative, then either A = R or A = C. If A is not 
commutative, its center is either R or €; € is ruled out since it is algebraically closed 
and so has no finite dimensional division algebras over it. Now suppose A is central 
over R and A? R. Then A contains an element i such that i = — 1 and since € = R(i) has 
an automorphism such that i + — i, there exists a j in A such that ji = — ij. Show that the 
subalgebra generated by i and j is Hamilton’s quaternion algebra H. Use exercise 1 to 
prove that [A : R] =4 and that A = H. 


. Let E be a cyclic field over F, [E : F] =n, Gal E/F, the Galois group of E/F, = <s>. Let 
(E,s,y) be the subalgebra of M,,(£) generated by all of the diagonal matrices 


(38) b = diag {b, sb,...,s"" 1b}, 


b e E, and the matrix 


(39) z= 


where y is a non-zero element of F. Note that b ~ bis a monomorphism and identify b 
with $. Then we have the relations 


= CE 5 E T 
(40) ch=s(b)z, =} 
and every element of (E,s,y) can be written in one and only one way in the form 
(41) batbizt +b, 2" 


where the b; € E. Prove that (E, s, y) is central simple and [E, s, y) : F] =n’. 


10. 


11. 


12. 


. Let E, EG,s be as in exercise 4. Note that s e EndpE and for any b € E, x ~ bx is in 


EndpÆE. Identifying this with b, we have sb = s(b)s. Show that the algebra L of linear 


transformations in E over F generated by the multiplications b € E and s is isomorphic 
to the algebra (E, s, 1) of exercise 4. Hence conclude that L = End;F and (E, s, 1) = 


M,(F). 


. Prove that (E, s, y) = M,(F) if and only if y is the norm of an element of E. (Hint: Use 


Theorem 4.9, p. 222). Hence show that if n = p is a prime and y is not a norm in £E, then 
(E, s, y) is a division algebra. 


. Let E = Q(r) where r is a root of x? + x? — 2x — 1. Srhow that E/@ is cyclic (see BAI, p. 


236, exercise 1). Show that if y is an integer not divisible by 8, then y is not a norm of 
any element of Æ. Hence conclude from exercise 6 that (E, s, y) is a division algebra. 


. Let A be an algebra over F with base (w,..., u,,) and let a + p(a) be the regular matrix 


representation determined by this base. Put M(a) = det p(a) (see BAI, p. 403). Show that 
Ais a division algebra if and only if Ma) + 0 for every a + 0 in A. Use this criterion to 
show that if ¢ is an indeterminate, then Æ is a division algebra if and only if Ar is a 


division algebra. Generalize to show that if ¢),...,t, are indeterminates, then Arg., 4) iS 


a division algebra if and only if A is a division algebra. 


. Show that if A, and A, are finite dimensional division algebras over F and A, is central, 


then A, ®r A, = M(E) where £E is a division algebra and 7|[A; : F], i= 1, 2. Hence show 
that A, ® A, is a division algebra if ([A, : F], [A, : F) = 1. 


If A is a subalgebra of an algebra B, a linear map D of A into B is called a derivation of 
A into B if 

(42) D(a,a) = a, D(a) + D(a, )a, 

for a; e A. Form B®) and define 

(43) a(x, y) = (ax + D(a)y, ay) 


for a € A, (x,y) € B®. Verify that B® is an A-module with this action. Show that B®) is 
an A ® B°P-module in which 


a®b(x, y) = (axb + D(a)yb, ayb). 


A derivation of B into B is called a derivation in B. Ifd € B, the map x ~ [d, x] = dx — 
xd is a derivation in B called the inner derivation determined by d. Prove that if A is a 
semi-simple subalgebra of a finite dimensional central simple algebra B, then any 
derivation of A into B can be extended to an inner derivation of B. 


Let A be a semi-simple subalgebra of a finite dimensional central simple algebra B. 


Show that Cp(A) is semi-simple and investigate the relation between the structure of A, A 
® BP and CRA). 


1.7 THE BRAUER GROUP 


The results on tensor products of finite dimensional central simple algebras given in the 
previous section lead to the introduction of an important group that was first defined by R. 
Brauer in 1929. The elements of this group are similarity classes of finite dimensional central 
simple algebras. If A and B are such algebras, we say that Æ is similar to B (A ~ B) if there 
exist positive integers m and n such that M,,(A) = M(B) or, equivalently, M„(F) @ 4 = M,(F) 


® B. The relation of similarity is evidently reflexive and symmetric. It is also transitive since if 
M,,(F)@A = M,(F)@B 
and 
M,(F)@B = M,(F)@C, 
then 
My(F)@A = M,(F)@M,(F)@A = M,(F)@M,(F)@B 


~ M,(F)®M,(F)@B = M,(F)@M,(F)@C = M,,(F)@C. 


Note that we have used associativity and commutativity of tensor multiplication as well as the 
formula M,(F) @ M,,(F) = M,,,(F), which was established in Proposition 4.10, p. 217. We 


now see that ~ is an equivalence relation. Let [A] denote the similarity class of A, that is, the 
equivalence class of finite dimensional central simple algebras similar to A. Suppose A ~ A' 


and B ~ B' so we have positive integers m, m', n, n' such that A ® M,(F) = A'® M,,(F) and B ® 
M(F) = B'® M„ (F). Then A ® B® M,a (F) = A' 8 B'® Mpm (F). Hence A 8B ~ A' @ B'. 
Thus we have a well-defined binary composition of similarity classes given by 


(44) [A][B] = [4 @ B] 

Evidently this is associative and commutative. Moreover, the set of matrix algebras M,(F) 
constitutes a similarity class 1 and this acts as unit since A ~ A ® M,(F). We have seen in 
Theorem 4.6, p. 218, that A ® A? ~ 1. Hence [A][A°?] = 1 = [AP [A]. 


If A is finite dimensional central simple over F, we can write A = Mn(F) @ A for A, a finite 
dimensional central division algebra. Conversely, if A is as indicated, then M,(F) @ A is finite 


dimensional central simple over F. We recall that the isomorphism theorem for simple artinian 
rings (p. 206) implies that if M,(A) = M„(A’) for division algebras A and A’, then m = n and A 


= A’. It follows that the division algebra A in the formula A = M,(F) ® A is determined up to 


isomorphism by A. Hence a similarity class [A] contains a single isomorphism class of finite 
dimensional central division algebras and distinct similarity classes are associated with non- 
isomorphic division algebras. Now the class of subalgebras of the matrix algebras M„(F), n = 


1, 2, 3,... is a set and every algebra over F is isomorphic to a member of this set. Hence the 
isomorphism classes of algebras over F constitute a set. This set has as subset the isomorphism 
classes of central division algebras over F. Hence the similarity classes of central simple 
algebras over F is a set. Our results show that this set is a group under the composition defined 
by (44). This group is denoted as Br(F) and is called the Brauer group of the field F. It is 
clear that the determination of Br(F) for a field F is equivalent to a complete classification of 
the finite dimensional central division algebras over F. 

If F is algebraically closed or is a finite field, then Br(F’) = 1. If F = R, then Br(F) is a 
cyclic group of order two by Frobenius’ theorem. We shall determine Br(F) for F a p-adic 
field in Chapter 9 (section 9.14). One of the most important achievements of algebra and 
number theory in the 1930’s was the determination of Br(Q) and, more generally, Br(F) for F a 
number field (that is, a finite dimensional extension field of @). This involves some deep 
arithmetic results. 

Let E be an extension field of F. If A is finite dimensional central simple over F, then A; is 
finite dimensional central simple over E. We have (4 ®pB)g = Ag ® Bg and M, (F)g = M,(£). 
This implies that we have a homomorphism of Br(F) into Br(E) sending [A] into [Az]. The 
kernel, which we shall denote as Br(E/F) is the set of classes [A] such that A, ~ 1 (1 for F is 


customary here), that is, the [A] such that A is split by E. We shall consider this group in 
Chapter 8 for the case in which F is a finite dimensional Galois extension field of F. 


EXERCISE 


1. Use Theorem 4.11 (p. 224) to show that if 4 is finite dimensional central simple over F 
and EF is a subfield of A/F, then C,(£) ~ Aç (in Br(£)). 


1.8 CLIFFORD ALGEBRAS 


In this section we apply some of the results on central simple algebras to the study of certain 
algebras—the Clifford algebras—defined by quadratic forms. These algebras play an 
important role in the study of quadratic forms and orthogonal groups. The results on these 
matters that we require can be found in BAI, Chapter 6. 

Let V be a finite dimensional vector space over a field F equipped with a quadratic form Q. 
We recall the definition: Q is a map of V into the base field F such that 


1. O(ax) =0e?O(x), a € Fixe V. 


2. B(x, y) = O(x + y)—O(x)—O(y) is bilinear. 


Evidently, B(x, y) is symmetric, that is, B(y,x) = B(x, y) and B(x, x) = 2O(x). We use Q to 
define an algebra in the following manner. 


DEFINITION 4.5. Let V be a vector space over a field E Ọ a quadratic form on V. Let 
T(V) = FOV@VOVI@VO@VOV)I®-~ be the tensor algebra defined by V (p. 140) and let Ko be 
the ideal in T(V) generated by all of the elements of the form 


(45) x®@x-Q(x)l, xer. 
Then we define the Clifford algebra of the quadratic form Q to be the algebra 
C(V,Q) = T(V)/Ko. 


Ifa e T(V), we write = 4+KeeC(V.Q) and we have the map i: x ~ x of V into C(V,Q). 
Since V generates T(V), i(V) generates the Clifford algebra. We have x* = Q(x)I. Moreover, we 
claim that we have the following universality of the map 7: If fis a linear map of V into an 
algebra A such that 


f(x =Q(x)l, xev, 


then there exists a unique algebra homomorphism g of C(V;Q) into A such that 


C(V, Q) 


(46) 


is commutative. To see this, we recall the basic property of the tensor algebra that any linear 
map f of V into an algebra has a unique extension to an algebra homomorphism f' of T(V) into A 
(see p. 140). Now for the given f, the kernel, ker /’, contains every element x ® x — O(x)l, x € 
V, since (x ® x—Q()1) = F(x) —Q0)! = 0, Hence the ideal Kg C ker f and so we have the induced 
homomorphism 9'4+Kg > f'(a). In particular, g(x) = f (x) = f(x), which is the commutativity of 
(46). The uniqueness of g is clear since the ¥ generate C(V,Q). 

The ideal Kg defining C(V;Q) contains every 


(x+y)@(x+y)—O(x+y)l = x@x+y@xt+x@yt+y@y 
—Q(x)1-B(x,y1-Qy)l, x,yeV. 


Since *®@x—Q(x)I and y®y—Q(y)l€ Ko we see that 

(47) X®y+y@x-Bix,yleKy, x,yeV. 
Equivalently, we have the relations 

(47') xp+yx = B(x, y)1 


in C(V.O) as well as x? = O(x)1. We can use these to prove 
LEMMA 1. /fthe elements u, u,..., u, span the vector space V over E then the elements 


(48) i=l gS" 1,<1,<° <4, I<ren 


span the vector space C(V,Q) over F. 


Proof. Since T(V) is generated by V and the u; span V, it is clear that the u; generate T(V), so 
every element of T(V) is a linear combination of 1 and monomials in the u; of positive degree. 


We now call 1 and the monomials "t't, with i < iz <> <in 1 Sr <n, standard. Let S be the 
set of these. We proceed to prove by induction on i and on the degree of u e S that uju is 


congruent modulo the ideal Ko to a linear combination of standard monomials of degree <deg 
u + 1. This is clear if deg u = 0. Now suppose u = upupn ... uj, r = 1. If i = 1, the result is 
clear if i, > 1 and if i; = 1, then "1" = u°," u, = Qiu," (mod Ko). Now leti > 1. Ifi <i 
the result follows as in the case i = 1. Hence assume i > i,. Then, by (47) 


Uj; Uj, Uj, = — U; Uii, * U;, + Blu;, u; )u;,*** u;, (mod Ko). 


By the degree induction, ujuj ... u 
standard monomials of degree <r. Then induction on the subscript 7 implies that uju; ... u 


ir is congruent modulo Kg to a linear combination of 
ir 
and hence ujujyuj. ... uj, is congruent modulo Kg to a linear combination of standard 
monomials of degree < r + 1. The result we have proved implies that if C’ is the subspace of 
C(V, Q) spanned by the elements $ Ai, ü, i1 < i2 <+ <i, then &C' c C’, This implies that C'C' 
c C' so C' is a subalgebra of C(V,Q). Since C’ contains P = {x|x € V} and P generates C(V, 


Q), we have C'= C(V, Q). D 


Evidently, the lemma implies that if dim V = n, then dim C(V, Q) < 2”, which is the number 
of standard monomials in the u;. 


For the sake of simplicity, we assume in the remainder of the text of this section that char F 


+ 2. The extension of the results to the characteristic two case will be indicated in the 
exercises. We assume first that B is non-degenerate and we separate off the lowest dimensional 
cases n = 1, 2. 

n= 1. We have dim C(V, Q) < 2 and if V = Fu, then u? = O(u)1 and O(u) £ 0, since B is 
non-degenerate. Let A = FT f]/(t? — O(u)1) where t is an indeterminate. Then A has the base (1, 
T) where ¢ = ¢ + (£ — Q(u) 1). Then #7 = Q(u) 1, which implies that the linear map fof V into A 
such that Au) = £ satisfies Ax)? = O(x)T. Hence we have a homomorphism of C(V, Q) into A 
such that ü ~t. This is surjective. Since dim A = 2 and dim C(V,Q) < 2, we have dim C(V,Q) = 


2 and our homomorphism is an isomorphism. If Q(u) is not a square in F, then £ — O(u) 1 is 
irreducible in F[¢] and C(V, Q) is a field. If O(u) = p°, p € F, then” ="%—F! satisfies 


e? =R —2fu+f71 = 20(u)l —2pi 
—2R(u—B1) = —2fe'. 


Hence e = —(2f) 'e’is an idempotent + 0, 1 in C(V,Q). Then C(V, Q) = Fe ® F(1 — e), a direct 
sum of two copies of F. Hence we have the following lemma. 


LEMMA 2. Jf B(x, y) ,is non-degenerate and n = 1, then dim C(V, Q) = 2 and if V = Fu, 
then C(V, Q) is afield or a direct sum of two copies of F according as Q(u) is not or is a 
square in F. 


n = 2. Choose an orthogonal base (u,v) for V. Then O(u)Q(v) + 0. Let C’ be the Clifford 


algebra of Fu relative to the restriction Q' of O to Fu. Then C' has a base (1, #) where # = 
Q(u) 1. Now consider the matrices 


a 0 70 D 
v=o a) = (ou 4 


in M,(C’). We have 


z= | () u op f H — tu) 
uy’ = ), v'u =| 
—Otvju 0 O(vu 0 


so u'v'+ v'u'=0. Also u? = O(u) 1, v? = O(v) 1. It follows that A= F1 + Fu'+ Fv'+ Fu'v'isa 
subalgebra of /,(C’). It is clear from the form of the matrices that 1, u' v’, u'v' are linearly 
independent. Hence dim A = 4. The relations on u' and v' imply that the linear map f of V into A 
such that u ~ u', v ~ v’ satisfies Ax)? = Q(x) 1, x € V. Hence we have a homomorphism g of 
C(V, Q) into A such that g(#) = u’, (6) = v’. Then g is surjective and since dim C(V, Q)<4 and 
dim A = 4, it follows that g is an isomorphism and dim C(V, Q) = 4. 


An algebra A over a field F of characteristic #2 is called a (generalized) quaternion 


algebra if A is generated by two elements i and j such that 
(49) ?=0140, f=fl40, ij=-ji 


where a, P e F. We denote this algebra as (a, p) or (a, P/F if we need to call attention to the 
base field F. We now prove 


LEMMA 3. Any quaternion algebra is four-dimensional central simple over F. 


Proof. The relations (49) imply that any element of A is a linear combination of the elements 
1, i, 7, ij. Now suppose Al + ui + vj + pij = 0 for A, u, v, p € F. If we multiply this relation on 
the left by i and on the right by 7! = ai, we obtain Al + ui — vj — pij = 0. Then Al + ui = 0 
and vj + pij = 0. Multiplication of these on the left by j and on the right by 7! = £~! then gives 
Al — wi = 0 and vj — pij = 0. Then Al = wi = vj = pij = 0 and A = u = v = p = O. Hence the 
elements 1, i, j, ij are linearly independent and so these constitute a base for A. Hence dim.4 = 
4. Now let J be an ideal 4 A in A and let i = 1+ 77 =i+/,/ =j + 1. Then 
i? = al # 0, J? = pT #0, ij= —ji.. Hence A/I is a quaternion algebra and so dim A/I = 4. Then J = 
0 and hence A is simple. It remains to show that the center of A is F1. Let ¢ = 41 +pi+vj+pije 
center of A. Then ci = ic implies that v = p = 0. The fact that 21 + ui commutes with j implies 
that u = 0. Hence c = A1 and the center is F1. 0 


The result we obtained before on C(V, Q) in the case n = 2 can now be stated in the 
following way. 


LEMMA 4. /f B(x, y) is non-degenerate and n = 2, then C(V, Q) is a quaternion algebra. 


Proof. We had an isomorphism of C(V, Q) with the algebra A generated by wu’ and v’ such that 
u? = Q(u) 1 £0, v? = O(v”) 1 + 0, and u'v'=-— v'u'. Then A and, hence, C(V, Q) are quaternion 
algebras. O 


We recall that if B(x, y) is a symmetric bilinear form on a vector space V and (u4, u,..., Up) 
is a base for V, then ô= det (B(u, u;)) is called a discriminant of B. A change of base replaces 
ô by of? where £ + 0 is the determinant of the matrix giving the change of base. B is non- 
degenerate if and only if ô # 0. We recall also that if U is a subspace of V on which the 
restriction of B to U is non-degenerate, then V= U® U1. Moreover, the restriction of B to Us 
is nondegenerate. 


We shall now prove the following factorization property. 


LEMMA 5. Let B(x, y) be non-degenerate and dim V > 3. Let U be a twodimensional 
subspace of V on which the restriction of B to U is non-degenerate. Write V = U® Us and 


let O' and Q" denote the restrictions of Q to U and Us respectively. Then 
(50) C(V,Q) = C(U,Q')@C(U*, -5'Q") 


where ô' is a discriminant of the restriction of B to U. 


Proof. We shall prove (50) by using two universal map properties to produce inverse 
isomorphisms between C(V, Q) and the right-hand side of (50). We denote the canonical maps 
of U into C(U, Q’) and of Us into C(Us, — 6'0") as i': y ~ y' and i': z + z" respectively. Let 
(u, v) be an orthogonal base for U and put d=2m (in C(V, Q)). We have 
ü? = Q(u)l, ö = Q(v)l, w= —öü şo = dd? = —4Q(u)Q(v)1 = — Blu, u)B{v, v)1 (since B(x, x) = 2Q(x)) = —ô'1 
where 0’ 1s the discriminant of the restriction of B to U defined by the base (u, v)). Also if y € 
Uandz €e Us, then Jd = —djy, yZ = —27, and dz = Zd; hence 


(51) yd) = (@2)p, 2)? = -8'0(2)1. 


Since y? = Q(y) 1 and (2) = —%'Q(2)l, the universal map property of Clifford algebras implies 
that we have homomorphisms of C(U, Q^ and C(U:, — 6'Q”) into C(V, Q) sending 
y'~y, 2° ~d2 respectively. The elements ¥ and dz generate the images under the 
homomorphisms and, by (51), these elements commute. Hence the images under our 
homomorphisms centralize each other, so by the universal map property of tensor products, we 
have a homomorphism / of C(U,2)@C(U*, —%'Q") into C(V, O) such that 


(52) yoy, z'ĘŁdz3}_ yeU,zeU’. 


Now consider the element d'= 2u'v' in C(U, Q’). The calculations made before show that d 


' = —y'd' and d? = -6' 1, so d' is invertible in C(U, QO’). Next consider the element 
y +d '2" of C(U,Q)@C(U*, — 0'0”), We have 


(y +d 12")? = y? + (yd! d'y)" 4-2? 
= y? +2"? = O(y)l+Q(z)l = O(y+z)1. 


Hence by the universal map property of C(V, Q) we have a homomorphism g of C(V, Q) into 
C(U,Q')@C(U*, — 8'0") such that 


(53) +> y +d!" 


Checking on generators, by (52) and (53), we see that gh = 1 on C(U.Q)@C(U*, = 8'0") and hg = 
1 on C(V, O). Hence C(V, Q) = CCU. Q9@C(U*, -8'Q"), D 


We are now ready to prove the main theorem on the structure of Clifford algebras. 


THEOREM 4.13. Let Q be a quadratic form of an n-dimensional vector space over afield 
F of characteristic #2. Then 


(1) dim C(V, Q) = 2” and if (uy, U,..., u,) is a base for V, the elements 
I> “2 n 


(54) EERTE TEAN <a Lig gn 


b l| 
constitute a base for C(V, Q) over F. 
(2) The canonical map i : x ~ X of V into C(V, Q) is injective. 
(3) If the bilinear form B associated with Q is non-degenerate, then C(V, Q) is central 
simple if n is even and ifn = 2v + 1, v e Z, then C(V, Q) is either simple with a two- 


dimensional field as center or is a direct sum of two isomorphic central simple 
algebras according as (—1)' 20, ô a discriminant of B, is not or is a square in F. 


Proof. We note first that the second statement in (1) and statement (2) are consequences of the 
dimensionality relation dim C(V, Q) = 2”. For, by Lemma 1, the elements in (54) span C(V, Q). 
Since the number of these elements is < 2”, if dim C(V, Q) = 2”, their number is 2” (that is, they 
are distinct) and they form a base. This implies also that the ñ, 1 < i < n, are linearly 
independent and hence the linear map 7 : x ~ ¥ is injective. 

Next we prove the dimensionality relation and (3) in the non-degenerate case. If n = 1, the 
results follow from Lemma 2 and if n = 2, they follow from Lemmas 3 and 4. Now assume n > 
2. Then we can pick a two-dimensional subspace U on which the restriction of B is non- 
degenerate. Then V = U® U: and the restriction of B to Us is non-degenerate. By Lemma 5, 
C(V, O) = C(U, Q0) ® C(Uz, — ô'Q") where 0’ is a discriminant of the restriction of B to U. 
Moreover, C(U, Q’) is a quaternion algebra, hence, is fourdimensional central simple. Using 
induction on the dimensionality, we may assume the results for the quadratic form —ô'Q" on U4. 
Then dim C(U:, — 6'0") = 2” ~ and this algebra is central simple if n — 2 is even. Hence dim 
C(V, Q) = 272" 7? = 2", and by Corollary 2 to Theorem 4.7, p. 219, this algebra is central 
simple ifn is even. Ifn =2v+1,n—2=2(v-— 1) + 1 and the induction hypothesis implies that 
if ô” is a discriminant of the restriction of B to Us, then C(U:, — 6'Q") is simple with two- 
dimensional center or is a direct sum of two isomorphic central simple algebras according as 
(—1)’~ ! 2( — 6” ~2 6" is not or is a square in F. Accordingly, C(V, Q) is simple with two- 
dimensional center or is a direct sum of two isomorphic central simple algebras according as 
(—1)”~ ! 2( -— 8)" ~2 6" is not or is a square. Now 


(—1)'~12(- 6)" 25" = (= 1) (6) -328'0" 
= (—1)'(5')"" 326 
and since n — 3 is even this is a square if and only if (— 1)” 26 is a square. This proves (3). 


It remains to prove the dimensionality formula in the degenerate case. For this purpose, we 
shall imbed V in a finite dimensional space W with a quadratic form that has a non-degenerate 


associated bilinear form and is an extension of Q. To do this we write V = Vs ® U for some 
subspace U. Then the restriction of B to U is non-degenerate. Now put W = Vi ® U @ (V21)* 
where (Vz)* is the space of linear functions on Vs. Letx =z +y +fwhereze Vz ye U, and f 
e (V.)* and define 


(55) O(x) = O(z)+Q(y) +f (2). 


It is readily seen that @ is a quadratic form on W whose associated symmetric bilinear form B 
is non-degenerate. Let x + ¥ be the canonical map of W into C(W, @). It follows from the 
universal property of C(V, Q) that we have a homomorphism of C(V, Q) into C(W, @) such that 
¥ œ Xforxe V. Let (uj,..., Up Un + 15-++5 Ug) be a base for W such that (uj,..., u,) is a base for 
V. Since B is non-degenerate, the elements “Mir 41 <+ <J» 1 S84) are distinct and linearly 
independent. Then this holds also for the elements %.° T, ii <5 <4 1<r<n, Since the 


~ 


homomorphism of C(V, Q) into C(W, @) maps ü; ... &,. into a, ... ñ, the elements ñ; ... ü, i, 
<...<i,, 1<r<n, are linearly independent. Hence dim C(V, Q) = 2”. 0 


Since the map i: x + *¥ of V into C(V, Q) is injective, we can identify V with the 
corresponding subspace of C(V, Q). Hence from now on we assume V c C(V, Q). If U is a 
subspace of V, then the subalgebra of C(V, Q) generated by U can be identified with C(U, Q’) 
where Q' is the restriction of Q to U. This is clear from the last part of the proof of Theorem 
4.13. For, if (u)...u;,) is a base for U over F, then the argument shows that the elements 1, u,,, 


-Uj Í 1<i2<...<i,,1<r<m, are linearly independent and this implies that the canonical 
homomorphism of C(U, Q') into C(V, Q) is a monomorphism. 
It is clear from the definitions that if O = 0, then C(V, Q) is the exterior algebra E(V) 
defined by V (p. 141). The results (1) and (2) of Theorem 4.13 give another proof of properties 
of E(V) that were derived in BAI, pp. 411-414. 


We remark finally that the proof of statement (3) in Theorem 4.13 yields a stronger result 
than we stated in this theorem. We state this as the following 


COROLLARY. Let Q be a quadratic form on an n-dimensional vector space V over a field 
F of characteristic not two such that the associated bilinear form is non-degenerate. Then 
C(V, Q) is a tensor product of quaternion algebras if n is even and is a tensor product of 
quaternion algebras and its center if n is odd. Moreover, the center C is two-dimensional of 


the form F(c) where c* = (— 1 Y 2~"61, 6 a discriminant, and C is afield or a direct sum of 
two copies of F according as (— 1)’(20) is not or is a square in F. 


Proof. The first statement follows by induction on the dimensionality and the factorization 
lemma (Lemma 5). To determine the center in the odd dimensional case, we choose an 


orthogonal base (u4, U,..., u,,) where n = 2v + 1. Then Uj; = — ujui for i #7, which implies 


that the element c = uu ... u, commutes with every u; Hence c is in the center and since c É F 


1 and the center is two-dimensional, the center is F[c]. We have 
~1) 
C? = uju," u, uau u, = (— 1)” Niy Fy tawny 2 


(56) n : 
=(—1) [] Q(u,)1 = (-1)'27"61 


where ô is the discriminant determined by the base (uw, u,..., u,,). Then F[c] is a field or a 


direct sum of two copies of F according as (— 1)”2° "6 is not or is a square. Since n = 2v + 1, 
this holds if and only if (— 1)’26 is not or is a square. 0 


In the remainder of this section, we shall give a brief indication of some applications of 
Clifford algebras to the study of orthogonal groups. For this purpose, we need to introduce the 
even (or second) Clifford algebra C*(V, Q) defined to be the subalgebra of C(V, Q) generated 


by all of the products uv, u,v € V (that is, by V*). We recall that a vector u is called non- 
isotropic if O(u) # 0. Ifis non-isotropic, then 


(u,u) (uv) = uy(—uyut Blu, u,)l)vo = —Q(u,)uv+ B(u, u, uzv. 
Hence 
uv = Q(u,) *(B(u, u, )uyv—(u,u) (uyv)), 


which shows that C* = C* (V, Q) is generated by the elements uju, u € V. Now we can write V 
= Fu, + (Fu,)s and u = au, + v where a € F and v L u; Then uju = aQ(u,)l + uv. It follows 


that CT is generated by the n— 1 dimensional subspace V; = u,(Fu,)1. We have 


(57) (u,v)? = -u,v = —Q(u,)Q(v)! 


and the restriction of — QO(u,)O to (Fu) is a quadratic form Q, with nondegenerate bilinear 
form B,. Hence we have a surjective homomorphism of C((Fu,)s, Q,) onto C*. On the other 
hand, if (uw 1,..., u,) is a base for V, then 1, Uj + Up, i <<... <4, is a base for C(V, Q). Then the 


elements 1 and u,,... u; with even r are contained in C* and there are 2”~ ! of these. Thus dim 


C*> 2” l, while dim C((Fu;)s, Q,) = 2” ~ 1. It follows that Ct = C((Fu,)2, Q,). This proves 
the first statement in 


THEOREM 4.14. Let B be non-degenerate and char F + 2. Then the even Clifford algebras 
C* (V, Q) = C((Fu,)2, Q)) where u; is any non-isotropic vector and Q} is the restriction of 
— Q(u,)O to F (u,)s. C* (V, Q) is central simple if the dimensionality n of V is odd and is a 
tensor product of a central simple algebra and a two-dimensional algebra D, which is either 
a field or a direct sum of two copies of F if n = 2v. The two alternatives for D correspond 


respectively to the following: (— 1)’6 is not or is a square in F where ô is a discriminant of 
B}. 


Proof. The second assertion is an immediate consequence of the first and Theorem 4.13. Now 
assume n is even and let c = uju) ... u,, Where (uy, U>,..., u,,) is an orthogonal base for V. Then 


ce C™ and ujc = —cu; and uju;C = cuju;. Hence c is in the center of C™ and c É F1. Hence the 
center of C * is F[c]. As in (56) we have 


c? = (—1)' p61 


where £ = 2’ and F{c] is a field or a direct sum of two copies of F according as (— 1)” f76 
and hence (— 1)”ô is not or is a square. O 


In both the even and the odd dimensional case, the subspace Fc, where c = uyw ... u,, and 
(uj, U>,..., U,) is an orthogonal base, is independent of the choice of this base. For F[c] is 


either the center of C(V, Q) or the center of C(V, Q) *. Moreover, c É F1 and c? e F1 It is 
clear that these conditions characterize the set of non-zero elements of Fc. 

Now let f be an orthogonal transformation in V. Then f(x)? = O(f(x)) 1 = Q(x) 1. Hence the 
universal map property of C(V, Q) implies that f has a unique extension to an automorphism g 
of C(V,Q). Now g stabilizes C(V, Q)" and it stabilizes the center of C(V, Q) and of C(V, Q`. 
Since one of these centers is F[c], g stabilizes F[c]. It follows from the characterization we 
have given of the set of non-zero elements of Fc that g(c) = ac, a # 0 in F. Since g(c)* = g(c’), 


we have a = + 1 so g(c) = +c. Now we can write g(u;) = flu,;) = Laju;, aj; € F, where the 


matrix (aij) is orthogonal. Then 


glc) = f (Uy) f (Ua) f (Up) = Looe j, aj, Anj lj My Uyy 


Since uju; = — uju; if i #7, and u? = Q(u,)1, it is clear from the definition of determinants that 
the sum can be written as det(a)ujuz ... u,+ a linear combination of elements u; u;, ... Uj» i} < 
in <... < i, with r < n (cf. BAL p.416). Since these elements together with ujup ... u, 
constitute a base and since g(c) = +c, we have g(c) = det(a;7)c. Hence g(c) =c if fis a rotation 
and g(c) =— c otherwise. 

We now observe that any automorphism g of a finite dimensional semisimple algebra A that 
fixes the elements of the center C of A is inner. For, if 4 = 419429 ®A, where the 4; are the 


simple components of A, then 1 = 1, + 1,+ ... + 1, where 1; is the unit of 4; and 
A= Al, @A1,0°"@Al, and C = Cli@Cl,@°@Cl,, Hence g(I,) = 1; so g stabilizes every A; and g 
fixes the elements of the center Cl; of 4; By the Skolem-Noether theorem, there exists an 
element u; invertible in A; such that the restriction of g to A; is the inner automorphism 
determined by u;. Then g is the inner automorphism determined by u = J u;. 


We can apply this to the foregoing situation. Then we see that if the given orthogonal 
transformation f is a rotation, there exists an invertible element u e C(V, Q) such that 


a Li Ce Ps | D 7 
(58) f(x)=uxu™, xevV. 
These considerations lead to the introduction of the following groups. 


DEFINITION 4.6. The Clifford group I (V, Q) is the subgroup of invertible elements u € 
C(V, Q) such that uxu € V for all x e V. Clearly this is a subgroup of the multiplicative 
group of invertible elements of C(V, Q). The even Clifford group is r * (V, Q) =T(V, QO) N C 


+U, Q). 


Ifxe Vandu € T =T(V, Q), then uxu ! €e V and (uxw ')? = ux2u” != Q(x)l. Hence the 
linear transformation x ~ uxu ! of V is in the orthogonal group 0(V, Q). The map %, where x(u) 
is x ~ uxu “l, x € V, is a homomorphism of T(V, Q) into 0(V, Q) called the vector 
representation of the Clifford group. 


Letv e V be non-isotropic. Then v is invertible in C(V, Q) and for x e V we have 
2vxv = v(vx + xv) + (xv +vx)v— v?x—xv? = 2B(v,x)v—20(v)x. 
Since vxv_ ! = vxvv ? = O(v) !vxv, this gives 


(59) vxo! = —x+Q(v) Biv, x)v. 


Thus v € T. We recall that a map of the form 


(60) x > x— 0v) "Biv, xv 


is orthogonal and is called the symmetry S,, associated with the non-isotropic v (BAL p. 363). 
Evidently S, = S, if a #0 in F. We recall also that any orthogonal transformation is a product 


of symmetries and any rotation is a product of an even number of symmetries. The formula (59) 
now reads 


(61) y(v) = -S,,. 


Now it is clear that if the v; are non-isotropic, then v, ... v, € T(V, Q) and this element is in T* 


(V, Q) ifr is even. It is clear also that [*(V, Q) contains the group F* of non-zero elements of 
F. We can now prove 


THEOREM 4.15. The even Clifford group T* coincides with the set of products v} ... Vz, 
v; non-isotropic in V. We have the exact sequence 


(62) 1+ F* 51+ (V,Q)50*(V,0) 1 
where the second map is the injection of F*. 


Proof. Letu e T*. Then the automorphism a ~ uauw ! of C(V, Q) fixes the element c = 
UjU>...U,, Where (u4, U>, ..., U,) is an orthogonal base for V. Then, as above, y(u) € O* (V, Q). 


On the other hand, let fe O* (V, Q) and write f = Sy, +++ Syo, where the v, are non-isotropic 
vectors in V. Then vj... vz, € T* and 7(v, ..-¥2,) = Sy, -> Sy,» by (61). Thus y(I" ) = O°. The 


kernel of y restricted to T* is the intersection of I~ with the center of C(V, Q). Since either 
C(V, Q) or C(V, Q)" is central simple, it is clear that this intersection is F*. This completes the 
proof of the exactness of (62). Moreover, if u e T* then y(u)e O* , so there exists an element 
Vi «++ Vo, Such that y(v, ... vz) = x(u). Then u = av, ... Vz a E€ F*, and hence u = (av)vz ... 
Vz. This proves the first statement of the theorem. O 


Let C? be the opposite algebra of C = C(V, Q). We have x? = Q(x) for x € V c C°P. Hence 
we have a unique homomorphism of C into C°P sending x ~ x. This means that we have an anti- 
homomorphism i of C into C such that x ~ x. Then i” is a homomorphism fixing every x € V, so 
i? = 1. Thus / is an involution in C, that is, an anti-automorphism satisfying i? = 1. This can be 
characterized as the involution of C fixing all of the elements of V. We call i the main 
involution of C. Evidently C” is stabilized by i, so the restriction i|C* is an involution in C”. 


Now let u e I and write u =v, ... Vz, Then 


N(u) = i(u)u = Va," UU] "** D2, 


(63) ia 
= | | Q(v)e F*. 
' 


Ifu'e T*,we have 
(64) N(uu') = i(uu'juu' = i(u'ji(u)uu' = N (u)i(u')u' = N(u)N(u’). 


Hence u ~ N(u) is a homomorphism of T* into F*. It is clear from (63) that N (T*) is the set of 
products I17’@() where the v; are non-isotropic in V and this contains F*?, the set of squares of 


elements of F*. If fis any rotation, there exists a u e T* such that y(u) = f and u is determined 
up to a factor in F*. Hence the coset Mu)F*? in the group F*/F*? is determined by f. We now 
give the following definition, which ties together these concepts. 


DEFINITION 4.7. The kernel of the homomorphism N of T*;, into F* is called the spin 
group Spin (V, Q). Its image O'(V, Q) under the vector representation y is called the reduced 
orthogonal group. If f is any rotation and y(u) = f, then N(u)F*? is called the spinorial norm 


off 


The spinorial norm map is also a homomorphism (of O* (V, Q) into F*/F**). The spinorial 
norm of a rotation can be defined directly, without the intervention of the Clifford algebra. If f 
is a given rotation, then we can write f= Sy, ... S,, and we can simply define the spinorial 
norm of f to be the coset I*Q@F**, Since y(v, ... va) =f and N(i*e) = MFQ), this is the same 


element of F*/F**, which we have called the spinorial norm. The difficulty with the direct 
definition is that it is not apparent that the spinorial norm is well defined, since there are many 
ways of writing a rotation as product of symmetries. The definition using the Clifford algebra 
shows that we get the same elements of F*/F** no matter what factorization of f as product of 
symmetries is used. 

Now the reduced orthogonal group can also be defined as the kernel of the spinorial norm 
map. For, if fe O' = O'(V, Q), then there exists a u e Spin (V, Q) such that y(u) =f. Then the 
spinorial norm of fis N(u)F*? = F**. Conversely, if the spinorial norm of the rotation f is F*?, 
then f = Syria and Ili’) = #, 6 # 0. Replacing v; by & | v} we may assume M'O) = 1, 


V2 
Then y(v; ... vz) = fand N(v, ... vz) = 1, so the spinorial norm of fis F*. 


The reduced orthogonal group contains the commutator subgroup Q of 0, since any 
commutator can be written in the form 


(So, So) (So, So So, So) So, S) 


Vr+i s 


from which it is clear that the spinorial norm is F**. Thus we have the following inclusions 
among the various subgroups of O = O(V, Q) that we have defined: 


(65) 0>0730'2Q 


and o’=0* such that O*/O’ is isomorphic to the subgroup of F*/F** of cosets of the form BF*? 
where £ has the form !1i’@(), v, non-isotropic. We can say considerably more in the case in 


which Q is of positive Witt index (BAI, p. 369). 


THEOREM 4.16. Let QO be of positive Witt index. Then the reduced orthogonal group 
O'(V, Q) coincides with the commutator subgroup Q of O(V, Q) and 0° (V.QVO"(V.Q) = F*/F*, 


Proof. Since Q is of positive Witt index, there exists a subspace U of V that is a hyperbolic 
plane, so we have V = U ® v+ and U has a base (u, v) such that O(u) = 0 = O(v) and B(u, v) = 
1. The orthogonal transformations that stabilize U and act as the identity map on u+ form a 
subgroup O, isomorphic to the orthogonal group in U. OF = 0, ^ 9°(¥.@) is the set of linear maps 


fa Such that u ~ au, v ~ a` ly, w ~w for w € U* where a € F* (BAI, pp. 365366). We have f, 
= Su — vu — ay SO that the spinorial norm of f, is Q(u — v)O(u — av) = (— 1)( — a) = a. Since a 
can be taken to be any element of F*, this proves that 9*(V.QV0'(V.0) = F*/F*, Next let y be any 
nonisotropic vector in V and let O(v) = a. Then O(u + av) =a = Q(y). Hence by Witt’s theorem 


1 


there is an orthogonal transformation g such that g(u + av) = y (BAI, p. 351). Then Sy = e+”, 
Let fe O' and write f= S, ... S, where a; = Q(v;) and hence Ii’4eF" Then we have 


orthogonal transformations g; such that Se = Ser ^1 Si< 2r, Then f= 9Susang9i *'G2SereaeGar 
Since O(V, Q)/Q is abelian, 


fE haein Bris (mod Q) 
so to prove that fe Q, it suffices to prove that h € Q. Now fe O' and since Q c O', h e O. 
Hence h c O°», so h =f, = S , Sy — ay, aS above, and a = f?. Then! =h = Sve ‘S-AR, D 


The main structure theorem on orthogonal groups, which we derived in BAI, states that if Q 
has positive Witt index and n > 3, then Q/(Q N {1, — 1}) is simple except in the cases n = 4, 
Witt index 2 and n = 3, |F| = 3. An interesting question is, when does — 1 € Q? This can 
happen only for even n. In this case we have 


PROPOSITION 4.11. Let n be even and let Q be of positive Witt index. Then — 1 € Q if 
and only if the discriminant is a square. 


Proof. Let (u, U2, ..., U,) be an orthogonal base. Then the discriminant obtained from this base 
is 2” MQ) and this is a square if and only if 1i@™) is a square. On the other hand — 1 = S,, Sy, 
. Sy 


IQ) is a square. O 


„ SO the spinorial norm of — 1 is MQ). Hence by Theorem 4.15, — 1 € Q if and only if 


EXERCISES 


1. Show that any central simple algebra of degree two (p. 222) over a field F of 
characteristic # 2 is a quaternion algebra as defined on p. 232 and that any such algebra 
is isomorphic to a Clifford algebra C(V, Q) where dim V = 2. 


In exercises 2—9, Q is a quadratic form with non-degenerate bilinear form B on an n- 
dimensional vector space V over F of characteristic £ 2. 


2. Let n be even and (uj, u2, ..., u,,) an orthogonal base for V over F so O(u;) = y; £ 0. 
Obtain an explicit formula 
(66) C| V, 0) > (i, b) Al, 63)® i ®(a,, B,) 


where v = n/2 and (a, f) denotes the algebra with base (1, i, j, k) such that i? = a1, j? = 


Bl, ij =k=—i 
3. Use exercise 2 to show that C(V, Q) ~ 1 if Q has maximal Witt index ( = v, see p. 370 of 
BAI). 


4. Let n be odd, but otherwise let the notations be as in 2. Obtain a formula like (66) 
tensored with the center for C(V, Q) and a formula like (66) for C* (V, Q). 


5. Apply exercise 4 to obtain the structure of C(V, Q) and C* (V, Q) if n is odd and Q has 
maximal Witt index. 


6. Letn = 4, F =R. Obtain the structure of C(V, Q) for the following cases in which (u4, u7, 
U3, u4) is an orthogonal base for V and the matrix diag {Q(u,), O(uz), O(u3), O(ug)} is 
respectively 


I. diag { 1,1, 1,1} 
I]. diag {1,1,1-1} 
III. diag {1,1,-1,-1} 
IV. diag {1, -1,-1, -1} 
V. diag{—1,-1,-1,-1}. 


7. Note that a ~ ‘a (the transpose of a) is an involution in M,(F). Use the Skolem-Noether 
theorem to show that any involution in M,,(/’) has the form 


J,:a~> sas”! 


where ‘s = +s. Let Sym J, be the subspace of M,(F) of J,-symmetric elements, that is, 
satisfying J,(a) = a. Show that dim J, = n(n + 1)/2 if îs = s and dim J, = n(n — 1 )/2 if fs 


== 55 


8. Let A be a finite dimensional central simple algebra over F, E a splitting field for A. 
Show that if A is viewed as contained in A; = M,(£), then any involution J in A has a 


unique extension to an involution J; of A; over E. Then Jg = J, for some s e M,(E) such 


that ‘s = + s. Call J of orthogonal or symplectic type according as s is symmetric or 
skew. Show that J is of orthogonal (symplectic) type if and only if dim Sym J = n(n + 
1)/2 (n(n — 1)/2). 


9. Determine the type of the main involution in C(V, Q) for n even and the type of the 
involution induced in C* (V, Q) by the main involution if n is odd. 


If J and K are involutions in 4 and B respectively then we write (A, J) = (B, K) if there 
exists an isomorphism 7 to A onto B such that yJ = Ky. 


10. Prove the following extension of Lemma 5: If U is a 2v-dimensional subspace of V on 
which the restriction of B is non-degenerate, then C(V, O) = C(U, QO’) ® C(U, — 6'0"” 
and (C(V, Q), i) = (C(U, Q^) ® C(U, — 6'0"), i @ i) if v is even. (Here i is the main 
involution.) 


11. Let c denote the conjugation a ~ a in € and s the standard involution a a in t. Show that 
(COH, cs) = (MAC), °) (a) = tay, and (H@H,s@s) = (M4R),1)., 


The next three exercises concern isomorphisms for (C(V, Q), D where V is a finite 
dimensional vector space over R and Q is a positive definite quadratic form on V. 


12. Show that 


(i) (C(V,Q).i) = (ROR, Inga) if [V:R] = 1. 
(ii) (C(V,Q), 1) = (M,(R), 0) if [V:R] = 2. 
(iii) (C(V,Q),1) = n (C), *) where *:a ‘aif [V:R] = 3. 
(iv) (C(V, 0), 1) = (M(H), *), *:a + ‘a, if[V: R] = 4, 
13. Show that ifn = 2v, then 

(C(V,Q), 1) = (M2.(R), t)) if v = 0,1 (mod 4) 
(C(V, 0), 1) = (M42.-.(1H), *) if v = 2,3 (mod 4) 
(* as in Exercise 12.) 


14. Show that ifn = 2v — 1 then 


(C(V,Q), 1) = (My.-1(R), t) ® (Maz- (R), t) if v = 1 
or 3 (mod 4) 
(C(V,Q), 1) = (M.-.(C), *) if v = 0 or 2 (mod 4). 


Exercises 15—18 sketch a derivation of the main structure theorem for Clifford algebras for 
arbitrary fields including characteristic two. 


15. 


16. 
17. 


18. 


Define a quaternion algebra over an arbitrary field F to be an algebra generated by two 
elements 7, 7 such that 


(67) ?=ital, 4a+140, j =pl¥0, ji=(l-i). 
Show that any such algebra is four-dimensional central simple. Show that this definition 
is equivalent to the one given in the text if char F # 2. 

Now let V be an n-dimensional vector space equipped with a quadratic form Q with 
non-degenerate bilinear form. Note that if char F = 2, then B is an alternate form and 
hence n is even. 


Show that if = 2, then C(V, Q) is a quaternion algebra. 


Show that Lemma 5 is valid for arbitrary F. (Hint: Let (u, v) be any base for U and let 
d= w-®a, Use this a in place of the one used in the proof given in the text to extend the 
proof to arbitrary F.) 


Prove Theorem 4.13 for B non-degenerate and F arbitrary. Note that the proof of dim 


C(V, Q) = 2” given in the text by reduction to the non-degenerate case carries over to 
arbitrary F. 


In exercises 19—21 we assume that char F = 2. 


19. 


20. 


Define C* = C* (V, Q) as for char F £ 2: the subalgebra generated by all products uv, u, 
v € V. Show that dim C* = 2”~! and C* is generated by the elements u,v for any non- 
isotropic u,. Show that the subalgebra C’ of C* generated by the u,v such that O(u,) # 0 
and »Łu is isomorphic to a Clifford algebra determined by an (n — 2)-dimensional 
vector space and that this algebra is central simple of dimension 2” ~ 7. Let (11, vy, ..., 
U, Vv) be a symplectic base (that is, Bso) = ô Btu,u;) = 0 = Ble.e)), Show that ¢= Diu is in the 
center of C*, that c € F1, and that ?+¢+=i@@) =°, Hence conclude that €* = AJC and 
Fic] is the center of C*. Show that F[c] is a field or a direct sum of two copies of F 
according as Li@)0e) is not or is of the form f? + £, B € F. Thus conclude that C* is 
simple or a direct sum of two isomorphic central simple algebras according as 

Li Olulo) # B+B or = PP +f, Pek, 

Let F! denote the set of elements of the form £? + $, B € F. Show that Fl?! is a subgroup 
of the additive group of F and put G = F/F”, Let (u, vp ..., Up, v) be a symplectic base 


for V and define 


68) Art = F 0(u)Q(0) +F? 


1 
in G. Show that this is independent of the choice of the symplectic base. Arf Q is called 
the Arf invariant of Q. Note that the last result in exercise 14 can be stated as the 


following: C* (V, Q) is simple if and only if Arf Ọ £ 0. 


21. Show that there exists a unique derivation D in C = C(V, Q) such that Dx =x for x e V. 
Show that C* (V, Q) is the subalgebra of D-constants ( = elements a such that Da = 0). 


Historical Note 


Clifford algebras defined by means of generators and relations were introduced by W. K. 
Clifford in a paper published in 1878 in the first volume of the American Journal of 
Mathematics. In this paper Clifford gave a tensor factorization of his algebras into quaternion 
algebras and the center. The first application of Clifford algebras to orthogonal groups was 
given by R. Lipschitz in 1884. Clifford algebras were rediscovered in the case n = 4 by the 
physicist P. A. M. Dirac, who used these in his theory of electron spin. This explains the 
terminology spin group and spinorial norm. The spin group for orthogonal groups over ® are 
simply connected covering groups for the proper orthogonal groups. As in the theory of 
functions of a complex variable, multiple-valued representations of the orthogonal group 
become single-valued for the spin group. Such representations occurred in Dirac’s theory. This 
was taken up in more or less general form by R. Brauer and H. Weyl (1935) and by E. Cartan 
(1938) and in complete generality by C. Chevalley (1954). 
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Classical Representation Theory of Finite Groups 


One of the most powerful tools for the study of finite groups is the theory of representations 
and particularly the theory of characters. These subjects as generalized to locally compact 
groups constitute a major area of modern analysis that generalizes classical Fourier analysis. 
The subject of representation theory of finite groups is almost wholly the creation of 
Frobenius. Notable improvements and simplifications of the theory are due to Schur. During 
the past fifty years, very deep results have been added to the -theory by Brauer, and 
representation theory has played an important role in the explosive growth of the structure 
theory of finite groups, which began with the Feit-Thompson proof of a hundred-year-old 
conjecture by Burnside that every finite group of odd order is solvable. 

In this chapter, we are concerned with the classical theory of representations of finite 
groups acting on finite dimensional vector spaces over the field € of complex numbers or more 
generally over fields of characteristics not dividing the group order. This restriction on the 
characteristic implies complete reducibility of the representation or, equivalently, semi- 
simplicity of the group algebras. This permits the application of the structure and 
representation theory of finite dimensional semi-simple algebras to the representation theory of 
finite groups. However, there is considerably more to the story than this, namely, the theory of 
characters, much of which can be developed without recourse to the theory of algebras. 

We shall derive the classical results of Frobenius, Schur, and Burnside as well as Brauer’s 
results on induced characters and splitting fields. One of the most important contributions of 
Brauer is his modular theory, which deals with representations over fields whose 
characteristics do divide the group order and the relation between the representations over 
such fields and representations over ©. We shall not consider this theory in our account. A 
number of applications of character theory to structural results on finite groups will be given. 


5.1 REPRESENTATIONS AND MATRIX REPRESENTATIONS OF 


GROUPS 


DEFINITION 5.1. By a representation p of a group G we shall mean a homomorphism of G 
into the group GL(V) of bijective linear transformations of a finite dimensional vector space 
V over afield F. 


We shall say that p is a representation of G over F and that it acts on V/F. The 
dimensionality of V is called the degree of the representation. The defining conditions for a 
representation are that p is a map of G into GL(V) such that 


(1) P(9192) = P(91)p (92) 


for g; e G. These have the immediate consequences that p(1) = 1,, p(g!) = p(g)! and in 
general, p(2¢’") = p(g)” for m € Z. We remark also that the conditions that p(g)e GL(V) and (1) 
holds can be replaced by the following: p(g) € End;V, (1) holds, and p(l) = 1. These 
immediately imply that p(g)e GL(V), so p is a representation. 

Let B = (uy, u,..., u,,) be a base for V/F. Ifa € End; V, we write 


n 
(2) qu; = y Xiu L<icn 
j=1 


and obtain the matrix (a) whose (i, j)-entry is a. The map a ~ (a) is an isomorphism of End; 
V onto M,,(/). Hence ifp is a representation of G acting on V, then the map 


(3) g > px(9). 


where pg(g) denotes the matrix of p(g) relative to the base B, is a homomorphism of G into the 
group GL,(F) of invertible n x n matrices with entries in F. Such a homomorphism is called a 
matrix representation of G of degree n. A change of the base B to C= (1, ..., v,,) where v; = 
Xu; u; and (u) € GL,(F) replaces pz by pe where 


(4) Pclg) = (u) ` plg) (u). 


This matrix representation is said to be similar to pp. It is clear that any matrix representation 
can be obtained from a representation in the manner indicated above. 
A homomorphism of G into the symmetric group S,, or, equivalently, an action of G on the 


finite set {1, 2, ..., n} (BAI, p. 71), gives rise to a representation. Let the action be described 
by 


(5) gi = n(g)i 


where z is the homomorphism of G into S,,. Let V be the vector space with base (uw, Uo, ..., Up) 
and let p(g) be the linear transformation of V such that 


(6) p(g)U; = Urg) 


Then p(g12>) = p(€Dp(e2) and p(l) = 1, so p is a representation of G. Representations of this 


type are called permutation representations. They are characterized by the property that they 
are representations that stabilize some base B of V/F. Of particular interest is the permutation 
representation obtained from the action of G on itself by left translations (BAI, p. 72). The 
corresponding representation of G is called the regular representation. 


EXAMPLES 
1. Let G = @, the cyclic group generated by an element g of order n. We have the 
homomorphism of G into S,, mapping g into the cycle (12...n). This action is equivalent to the 


action of G on itself by left translations. The associated permutation representation maps g into 
the linear transformation p(g) such that 


(7) p(g)u; = úi L<ign-l,  p(g)u,= u. 
This gives a matrix representation in which g is represented by the matrix 
0... 01 
10. + 0 
0 | 
(8) 
O> >10 


This is obtained by specializing (2) to (6) 
2. Let G = $3 whose elements are 


1, (12), (13), (23), (123), (132). 


The identity map is an isomorphism of G onto itself. This gives rise to a permutation 
representation and an associated matrix representation in which 


(13)>10 1 


(123) ~+{ 1 0 
0 1 


aS Q m 


0 
0 


(12) ~> 


OR) 


(132) ~ 


O = © 


G Ga = 


J 


0 
0 


Oo © 


an 


© 


1 
0 


For a given group G and given field F we now consider the class &(G, F) of 
representations of G acting on vector spaces over F as base field. There is a rich algebraic 
structure that can be defined on X(G, F). First, let p4 and p, be representations of G acting on 
the vector spaces V,/F and V,/F respectively. Form the vector space V,;® p V and define p; @ 


p2 by 


(9) 


(P:®p) (9) = pi9)®p9) 


where, as usual, a, © a> for the linear transformations a; of V; is the linear transformation of V; 


® V, such that 


(10) (a, Qa) (x; @Xz) = a,X,@azxy. 


If b;, i = 1, 2, is a second linear transformation of V;, then we have (a, ® a,)(b, @ by) = a,b, @ 
aby. It follows that if g4, g) € G, then 


(01 @P2) (9192) = P1(9192)@P2(G192) 
= 01(91)P1(92)@p2(91)p2(92) 
= (0:(91)@p2(91)) (01(92)®p2(92)) 
= ((0; ®p2) (91) (01 @p2) (G2). 


Hence p; ® p> is a representation. We call this the tensor product of the given representations 


pı and po. 
If (uy, Uo,..., U,) is a base for V/F and (vi, V9,..., Vm) is a base for V>/F, then the mn 
vectors u; ® v; constitute a base for (V; ® V2)/F. We order these lexicographically: 


(11) (Uy OVis.. -s Uy Om Uz @Vq, «+ UOU m + + +5 Uy QV yy) 


Then if a, € End, V; has the matrix (a) relative to the base (u, ..., u,,), and a) € End; V> has 


the matrix (a) relative to (vj, ..., Vp), we have 
= (1) DEIN: Gems (2),, 
au =} au, avy, = > aly, 
j l 


and 


(a; Qa) (uj@v,) = ¥ Pallu Ov). 
jl 


Hence the matrix of a, ® a, relative to the base (11) is 
1)/ (2) W)(yl2)) we. yyl?) 
aira) aja) Xin (a?) 


1)/ (2 
(12) at) (a'?)) of) (a'?)) cine ah (a ’) 


1 2 1 2 
(a) aiz (al r) m ain (a! ') 


We denote this matrix as ((a™) ® (a®)). In particular, we see that if p;g (g) is the matrix of 
p,(g) relative to B, = (u, ..., u,) and P2p,(8) is the matrix of p,(g) relative to By = (vp ..., Vm) 
then the matrix of (p; ® p2) (g) relative to the base (11) is P1B(8) © P282). 

As usual, we denote the dual space of linear functions on V/F by V*. If (w, uo, ..., u,) iS a 


base for V/F, we have the dual (or complementary) base (uy, uz, ..., Un ) of V'/F where i is 


the linear function on V such that u; (u;) =ðj, 1 <j <n. Ifa is a linear transformation in V/F, 
we have the transposed transformation a* in V* such that 


(13) a*x*(y) = x*(ay) 


for y e V and x* e V* If au = Ya;u; and a*u*, = XB,u*), then 
aut (ui) = EB uur (u) = Ba and ut lau) = wean) = % Hence B, = az; and so the matrix of a* relative to 
the dual base (u,*, ..., u,,*) of (up ..., u,,) is the transpose of the matrix of a relative to (up ..., 


u,). The map a ~ a* is an anti-isomorphism of the algebra End; V onto End; V* and hence a ~ 
(a*) 7! is an isomorphism of the group GL(V) of bijective linear transformations in V into 
GL(V*). 

Now let p be a representation of G acting on V. We compose this with the isomorphism a ~ 


(a*)! of GL(V) onto GL(V*). This gives a representation g ~ (p(g)*)! of G acting on V*. We 
call this the contragredient representation of p and denote it as p*. Evidently it has the same 
degree as p. Moreover, if B = (u, ..., u,) and B* is the dual base for V*, then we have the 


following relation for the matrix representations g ~ pp(g) and g ~ p*p+(g): p*gx(g) = (‘pp(g)) 
1 

The map p ~ p* may be regarded as a unary composition in X(G, F). We also have an 
important nullary composition. This is the unit representation | or 1g for which V is one- 
dimensional and 1(g) = 1, for all g. The corresponding matrix representation (determined by 
any base) is g ~ (1) where (1) is the 1 x 1 matrix with entry 1. 

Let p; i = 1, 2, be a representation of G acting on V/F. Then we say that p, and p, are 
equivalent if there exists a bijective linear map 7 of V, onto V, such that 


(14) p2(9)=np,(gn', geG. 


EXERCISES 


1. Let p be the representation of the cyclic group “@ given in example 1. Let € be the base 
field. Show that p is equivalent to the representation p’ such that p'(g)u; = Cu; where ¢ = 


erin., 
2. Let p;, i = 1, 2, be a representation of G acting on V/F and put V = hom,{V,, V). fle V, 


define p(g)/ = p,(g)lp,(g) |. Verify that p is a representation and show that p is 
equivalent to p;* @ po. 


3. Let Z denote the identity map of GL(V). This is a representation of GL(V) acting on V. 
Consider the representation /* ® / acting on V*® V. Show that the set of vectors c e V* 
® V such that (/* @ /)(a)c = c for all a e GL(V) is a one-dimensional subspace of V* ® 


V. Find a non-zero vector in this space. 


4. Show that if p, i = 1, 2, is a representation acting on V; then p; ® p, and p, ® p} are 
equivalent. 


5.2 COMPLETE REDUCIBILITY 


The study of the representations of a group G can be reduced to the study of the representations 
of the group algebra of G. Let G be a group, F a field, then the group algebra F[G] is the 
algebra over F having the set G as base and multiplication defined by 


(15) » ua) (3 Bah | => a, B,gh 


geG heG g,h 


for a, Pn € F. Let p be a representation of G acting on the vector space V/F. Then the group 
homomorphism p has a unique extension to the algebra homomorphism 


(16) 20g > 2.0, plg) 


of F[G] into End-V. Conversely, given a homomorphism p of F[G] into End; V, where V is a 


finite dimensional vector space over F, the restriction of p to G is a representation since p(1) = 
1, which implies that every p(g) e GL(V). Now we recall that a representation of F[G] ( = 
homomorphism of /[G] into End-V) can be used to make V into an F[G]-module. One simply 


defines the action of F[G] on V by 


(17) (Sa,g)x = da, p(g)x 


for x € V. Again, this can be turned around: Given a module V for F[G], this is a module for F, 
hence, a vector space over F and, assuming finite dimensionality of V/F, we obtain a 
representation p of G where p(g) is the map x ~ gx, which is a linear transformation of V/F. 
Thus representations of G acting on (finite dimensional) vector spaces over a field F are 
equivalent to F[G]-modules, which as F-modules are finite dimensional. 

The standard concepts of module theory can be carried over to representations of groups 
via the group algebra. If p is a representation of G acting on V, a submodule U of V as F[G] - 
module is the same thing as a subspace of V that is p(G)-invariant in the sense that it is 
stabilized by every p(g), g € G. Then we obtain a representation p|U of G acting on U in which 
(plU) (g) is the restriction of p(g) to U. We shall call this a subrepresentation of p. We also 
have the module V/U. The associated representation of G is p|V/U where (p|V/U) (g) is the 
linear transformation x + U ~ p(g)x + U. This will be called a factor representation of p. 

Let B = (u, ..., u,) be a base for V such that (u, ..., u,) is a base for the p(G)-invariant 


subspace U. Consider the matrix representation pp determined by B. Since U is stabilized by 
every p(g), p(g)u; for 1 < i <r is a linear combination of the vectors (u, ..., u,.). Hence every 
matrix pp(g), g € G, has the “reduced” form 


Biro By 


Birt _ Bin 


Bri E Prr Brr+i _ Brn 


(18) 
Pitia 


| ee 


Persi = Bun 


The matrices in the upper left-hand corner are those of the matrix representation of G acting on 
U associated with the base (uj, ..., u,) and those in the lower right-hand corner are the 
matrices of the representation of G on V/U relative to the base (wu, , + U, ...,u, + U). It is 
clear that conversely if there exists a base (uw, ..., u,) such that the matrices of the p(g) 
relative to this base all have the form (18), then U = 2,"Fu; is a p(G)-invariant subspace, 
hence, an F[ G]-submodule of V. 

Ifv = U® U' where U and U' are submodules, then we can choose a base B = (up ..., Up) 
for V such that (uw, ..., u,) is a base for U and (u, +} ,..., u,) is a base for U’. Then the 
corresponding matrices p,(g) have the form (18) in which the r x (n — r) blocks in the upper 
right-hand corner are all 0. If p} = p|U and p, = p|U’ then we say that the representation p is a 
direct sum of the subrepresentations p, and p and we write p = p} ® p>. Let p be the 
projection on U determined by the decomposition V = U ® U'. Then if we write any x € Vas x 


=ytyye U,y'e U', we have p(g)x = p(g)y + p(g)y’ with p(g)y € U, p(g)y'e U' for all g 
e G. Since p is the map x ~ y, we have pp(g)x = p(g)y = p(g)px. Thus p commutes with every 
p(g). Conversely, suppose U is any p(G)-invariant subspace and there exists a projection p of 
V on U that commutes with every p(g). We can write V = pV @(l — p)V and pV = U. Also, p(g) 
(1 —p)V=(1 — p)p(g)V = C1 — p)V. Hence U' = (1 — p)V is p(G)-invariant and we have the 
decomposition V = U ® U’. 

We shall call a representation p of G irreducible (completely reducible) if the 
corresponding F[G]-module is irreducible (completely reducible). We recall that a module V 
is completely reducible if and only if it satisfies either one of the following conditions: (1) V = 
XV; where the V; are irreducible submodules, (2) V £ 0 and for every submodule U there exists 


a submodule U’ such that V = U ® U' (Theorem 3.10, p. 121). We shall use these conditions to 
prove two theorems giving sufficient conditions for complete reducibility. The first of these is 


a fundamental theorem in the representation theory of finite groups. This is 


MASCHKE’S THEOREM. Every representation p of a finite group G acting on a vector 
space V/F such that the characteristic char Ft\G\is completely reducible. 


Proof. Let U be a p(G)-invariant subspace of V and write V = U ® Up where Ug is a second 
subspace (not necessarily invariant). Let pọ be the projection on U determined by this 


decomposition. We shall now construct by an averaging process a projection on U that 
commutes with every p(g), g € G. We put 


(19) p=— } p(g)'pop(g): 


geG 


i 


Since char FtlGl, |G]! exists in F and p is well defined. If g'e G, then 


plg’) ‘pp(g’) “a Y alg’) Pg) ‘pop (g)o(g') 


© p(gg')” ‘pop(gq’) 


geG 


“i 


“ig? ~F pl 9)" pop(g) 


Hence p(g’)p = pp(g') for g' € G. Evidently, p e End;-V. Ify € U, then poy = y and since p(g)y 
€ U, poo(g)y = p(g)y. Hence p(g) ' pop(g)y =y and 


(20) ry = ig Lolo 'poplg)y = zly =»: 


Ifx € V, then pọx € U and p(g)! poo(g)x € U. Hence px € U. The two conditions on p € Endp 


V, py =y for y e U and px € U for x € V, imply that p is a projection on U. Then V = U ® U' 
where U' = (l — p)V and since p commutes with every p(g), U' is p(G)-invariant. O 


This result has a formulation in terms of matrix representations that should be obvious from 
the discussion above. The result in its matrix form was proved by H. Maschke for the case in 
which F = €. The validity of the result for any F with char FtiGl was first noted by L. E. 
Dickson. 


If p is a completely reducible representation of G acting on V, then the F[G]-module V 
decomposes as V= 11° O "SVP ONNO OWS OV" OVP" where the VE are irreducible 
and VP = V” for any k, / but Y} # V} if i473. By Theorem 3.12 (p. 123), the submodules Wm dah 
are the homogeneous components of V. If p; is any irreducible representation of G equivalent to 


the subrepresentation determined by the V), then we write 


(21) p ~mp Amp: OM, pr 


We call p; an irreducible constituent of p and m; its multiplicity. By Theorem 3.14, the 
equivalence classes of the irreducible constituents and their multiplicities are uniquely 
determined. We remark also that the multiplicity m; = q;/n; where q; = dim W, n; = dim V, 
which shows also that m; is independent of the particular decomposition of V as a direct sum of 
irreducible submodules. 


If H is a subgroup of G and p is a representation of G acting on V, then the restriction of p to 
H is a representation of H that we shall denote as py. We shall now show that if H=G and p is 


completely reducible, then pp is completely reducible. It suffices to prove this for p 


irreducible. Moreover, in this case we can say considerably more on the relation between the 
irreducible constituents and their multiplicities. First, we need to give a definition of 
conjugacy of representations of a normal subgroup of a group. 

Let H=G and let o be a representation of H acting on U. For any g € G we can define the 
map 


(22) h~ o(ghg~'). 


This is the composite of the automorphism h ~ ghg™! of H with the homomorphism ø. Hence 
(22) is a representation of H acting on U, which we shall denote as £o. Any representation 
equivalent to £g will be called a conjugate of o or, more precisely, a g-conjugate of o. 
Evidently, any o(H)- invariant subspace is £o(H)-invariant and since o = g (£0), it is clear 


that o and £c have the same lattices of F[H]-submodules. In particular, if o is irreducible, then 
any conjugate of ø is irreducible. Now suppose o, and o, are equivalent representations of H 


acting on U, and U, respectively. Then we have a bijective linear map 7 of U, onto U, such 
that o,(h) = yo(h)y', h e H. Then also, o2lhg™')=no,(ghg"'w"', heH, so *a, and %e, are 
equivalent. 

We can now prove 


(A. H.) CLIFFORD’S THEOREM. Let H=G and let p be an irreducible representation of 
G. Then py is completely reducible and all of its irreducible constituents are conjugate and 


have the same multiplicity. 


Proof. Let p act on V and let U be an irreducible F|H]-submodule of V. Evidently, Ło e 
cp(g)U is a p(G)-invariant subspace of V containing U. Hence V = Xp(g)U. Leth e H,y € U. 
Then 


(23) p(h)p(g)y = plhg)y = plgg”'hg)y = p(g)p(g- ‘hgy € p(g)U. 


Thus p(g)U is p(#)-invariant. Let o denote the representation of H acting on U and o' the 
representation of H acting on p(g)U. Now by (23), «they =e? ethy.. Since y ~ p(g)y is a 
bijective linear map of U onto p(g)U, we see that o’ is a conjugate of o. Hence p(g)U is an 
irreducible F[H]- module and V = Lp(g)U is a decomposition of V as sum of irreducible F[H]- 
modules such that the corresponding representations are conjugates. Then V is a direct sum of 
certain ones of the p(g)U. Hence py is completely reducible and its irreducible constituents 


are conjugate. If U, and U, are isomorphic irreducible F[H]-submodules, then p(g)U, and 
p(g)U> are isomorphic F[H] submodules. It follows that every p(g) permutes the homogeneous 


components of V as F[H]-module. Thus we have an action of G through p(G) on the set S of 
homogeneous components of V as F[H]-module. Moreover, this action is transitive since if U 
is any irreducible F[H]-submodule, then any other irreducible F[H]-submodule is isomorphic 
to a p(g)U for some g € G. This implies that all of the irreducible constituents have the same 
multiplicity. o 


EXERCISES 


— 


. Prove the following extension of Maschke’s theorem. Let p be a representation of a group 
G that is not necessarily finite. Suppose G contains a normal subgroup H of finite index 
[G : H] not divisible by char F. Show that if p|H is completely reducible, then p is 
completely reducible. 


2. Let F = € and let H(x, y) be a hermitian form on V/C that is positive definite in the sense 
that H(x, x) > 0 for all x 4 0 (BAI, pp. 381-384). Let p be a representation of a group G 
acting on V such that every p(g) is unitary with respect to H, (H(p(g)x, p(g)v) = H(x, y), 
x,y € V). Such a representation is called unitary. Show that if U is a p(G)-invariant 
subspace, then v+ is p(G)-invariant. Use this to prove that p is completely reducible. 


3. Same notations as in exercise 2. Let G be finite, p a representation of G acting on V. Let 
H)(x, y) be any positive definite hermitian form on V/C. Put 40%!) = LeoMole@ se) Show 
that H(x, y) is positive definite hermitian and that every p(g) is unitary relative to H. Use 
this and exercise 2 to prove Maschke’s theorem for F = €. 


4. Define the projective linear group PGL(V) of a finite dimensional vector space V/F as 
GL(V)/F*1 where F* is the multiplicative group of non-zero elements of F. A 
homomorphism p of a group G into PGL(V) is called a projective representation of G. 
For each g € G let u(g) denote a representative in GL(V) of the coset of p(g) € PGL(V). 


Then “u(g12>) = Ye pg LENE) where ben" F*. Define p(G)-invariant subspaces and 


complete reducibility as for ordinary representations. Prove the analogue of Maschke’s 
theorem for projective representations. 


5. Show that p is irreducible (completely reducible) if and only if p* is irreducible 
(completely reducible). 


6. Let y be a representation of G acting on W, V a y(G)-invariant subspace. Put p = yfV. 
Show that p* ® y has a subrepresentation equivalent to the unit representation. 


7. Let y and p be representations of G acting on W and V respectively. Assume p is 
irreducible and p* ® wy has a subrepresentation equivalent to the unit representation. 
Show that y has a subrepresentation equivalent to p. 


8. Show that the following are irreducible groups of linear transformations of the finite 
dimensional vector space V/F: (1) GL(V), (2) O(V, Q) the group of orthogonal linear 
transformations relative to a non-degenerate quadratic form Q, (3) O* (V, Q) the rotation 
subgroup of O(V, Q). (See Chapter 6 of BAL.) 


5.3 APPLICATION OF THE REPRESENTATION THEORY OF 
ALGEBRAS 


There are two main methods for developing the representation theory of finite groups: the 
structure and representation theory of finite dimensional algebras applied to group algebras 
and the theory of characters. Many results can be obtained by both methods, and the choice of 
the method is often a matter of taste. The first method is introduced in this section and the 
theory of characters appears in section 5. Our primary concern will be with representations of 
finite groups over fields whose characteristics do not divide the order of the group. However, 
the first two theorems that are given do not require these restrictions. 

We begin by defining certain algebras of linear transformations associated with a 
representation p of a group G acting on a vector space V/F. First, we have the enveloping 


algebra Env p(G), which is the set of linear transformations of the form >="? %*" | If we 
extend p to a homomorphism p of A = F[G] as in (16), then Env p(G) = p(A). Next we have the 
algebra A'’= End, V. Evidently, this is the set of linear transformations of V that commute with 
every a € p(A), so End,V is the centralizer in End;V of p(A). If p is irreducible, then by 
Schur’s lemma, A’ is a division algebra. Next we have A’= End,, V, which is the algebra of 
linear transformations that commute with every a' e A’. Evidently, A” > Env p(G) = p(A). We 
remark that if we define 4" = End ,V, then it is trivial to see that A’ = A'. Hence the process 
of creating algebras by taking endomorphism algebras breaks off with A”. We call this the 
double centralizer of p(A) and we say that p has the double centralizer property if A" = p(A). 
We have the following general result on completely reducible representations. 


THEOREM 5.1. Any completely reducible representation p of a group G has the double 
centralizer property. 


Proof. It is easily seen that this is a special case of Theorem 4.10 (p. 222). However, it is 
somewhat simpler to base the proof directly on the density theorem: Let (u4, uz, ... ,u,) be a 


base for V/F and let / be an element of the double centralizer of p(A). By the density theorem, 
there exists ana € p(A) such that au; = lu, 1 < i < n. Hence l =a € p(A) and A" =p(A). o 


We state next 


THEOREM 5.2. 4 finite group has only a finite number of inequivalent irreducible 
representations. 


This follows by applying Theorem 4.5 (p. 208) to the artinian ring A = F[G]. We omit the 
details, since this result will not play any role in the sequel. We shall now prove 


THEOREM 5.3. The group algebra A = F[G] of a finite group G over a field F is semi- 
simple if and only if char FtiG. 


Proof. Suppose first that char FłtlGl. Then every representation of G is completely reducible. 
Hence A regarded as (left) A-module is completely reducible. Then A is semi-simple by the 


structure theorem for semi-primitive artinian rings (p. 203). Now suppose char F] |G|. Consider 
the element 


(24) z=} g 


of A. This is non-zero and we evidently have 
(25) gz=2=zg 


for all g'e G. Hence F7 is an ideal in A. From (25) we obtain * = Es-s9'z = Gl: =, since char F| 
|G|. Hence Fz is a non-zero nilpotent ideal in A and A is not semi-simple. O 


For the remainder of this section, we assume that G is finite and that char FtlGl. We have 
where the A; are the simple components. By the matrix form of Wedderburn’s theorem, 
(27) A, = M,,(Aj) 


where A; is a division algebra over F. If J; is a minimal left ideal of 4, it is a minimal left 


ideal of A and hence gives an irreducible representation of G. Moreover, /; * J; if i #7 and 
every irreducible representation of G is equivalent to one obtained from one of the Z; (Theorem 


4.4, p. 208). Hence there are s equivalence classes of irreducible representations of G. We 
determine next the degree of the irreducible representations provided by Jj. Let {e,|1 < i, j <n} 


be the usual set of matrix units in M,(A), A a division algebra. Then 


(28) M,,(A) = M,(A)e,, °° OM, (A)enn 


and M_,(A)e;; is a minimal left ideal and every minimal left ideal of (A) is isomorphic as 
M,(A)-left module to M7, (A)e,;. We have the subalgebra Al of matrices d1 = diag{d, d, ..., d} 
and we can regard M,(A) as (left) vector space over A by defining d(a) for the matrix (a) to be 
(d1)(a). Then M,,(A) has the base {e,} over A, so its dimensionality over A is n>. Similarly, 
M,(Aje;, = Ae; ®...®Ae,;, so the dimensionality of M,(A)e; over A is n. Then the 
dimensionality of M,(A)e,; over F is 


(29) [M,,(A)e;;:A][A:F] = nd 


where d = [A: F]. It now follows that [/; : F] = n,d; where d; = [A; : F] and this is the degree 
of the irreducible representation provided by Z; We can summarize these results as 


ni? 


THEOREM 5.4. Let A= F[G] be the group algebra of a finite group G over a field F such 
that char Ft\Gl and let A = A, ®...®A, where the A; are the simple components of the semi- 


simple algebra A. Assume that A; = M,,(Aj) where A; is a division algebra and let [A; : F] = 
d, If I, is a minimal left ideal of A, then I, provides an irreducible representation of G and 
the irreducible representations obtained from the I, for 1 < i< s form a set of representatives 


of the equivalence classes of irreducible representations of G. Moreover, the degree of the 
irreducible representation provided by I; is n,d;. 


Let {pl < i < s} be a set of representatives of the equivalence classes of irreducible 
representations of G. As above, we may take p; to be the irreducible representation provided 
by a minimal left ideal J; of A; At any rate we may assume that p; is equivalent to the 
irreducible representation determined by /;. If p is any representation of G acting on V, then V 
= V, ®...®V,, where the V, are p(G)-invariant and p|V; is irreducible. Then p|V; = p; for some i 
and the number m; of j such that p|V; = p; for a fixed 7 is independent of the decomposition of V 


as V, ®...®V,. As on p. 254, we can write p ~ mp; ® m»p,® ... ®”sp, and call m; the 


S 
multiplicity of p; in the representation p. 


Now consider A as (left) A-module. Let p denote the representation of G determined by this 
module. The base G of A is stabilized by every p(g): x ~ gx. It is clear that p is the regular 
representation of G (p. 248). The decompositions (26) and (28) show that the multiplicity of 


the irreducible representation p; determined by J; in the regular representation is n; We 
therefore have the following result on the regular representation of G. 


THEOREM 5.5. Let the notation be as in Theorem 5.4. Then the multiplicity of p; in the 
regular representation is nj. 


The number of equivalence classes of irreducible representations of G is the number s of 
simple components A; of A= F[G]. Let cent A denote the center of A. Then 


(30) cent A = cent A, ®' Acent A, 


where the center, cent 4;, is isomorphic to the center of M,,(A;) and hence to the center of Aj. 
Hence cent A; is a field and is a simple component of the semisimple commutative algebra cent 


A. Then s is the number of simple components of cent A. We now determine a base for this 
algebra. 


PROPOSITION 5.1. Let G=C,(= {1}) UG, U... UC, be the decomposition of G into 
conjugacy classes (BAI, P. 74). Put 


(31) (; = 3 Ji. 


gC, 


Se ee ae ky ee A 
Proof. Ifg e G, then? “%~ *#4 99545“ since the map x ~ g xg permutes the elements 


of the conjugacy class C;. Hence c; commutes with every g € G and with every =*9¢/L], Thus 
c; € centF[G]. Now let®=>%#' e cent F[G]. Ifh € G, 


h~*ch=Y yh "gh =). Yng 


g g 


so the condition A~!ch = c gives Yhgh! = Yg A € G. Thus any two elements g and g’ in the same 


conjugacy class have the same coefficient in the expression €= ¥79' and hence c is a linear 
combination of the c;. It is clear that the c; are linearly independent and so (c), C2, ... , c,) is a 


base for cent F[G]. O 


Both the foregoing result and the fact that the number s of equivalence classes of irreducible 
representations of G is the number of simple components show that s < r, the number of 


conjugacy classes. More precisely, we have r = X,° [cent A; : F]. This shows that s = r if and 
only if cent A; = F for all i, that is, the A; are central simple. Since cent A; is a finite 
dimensional field extension of F, we have cent A; = F for all i if F is algebraically closed. 


Hence in this important case the number of equivalence classes of irreducible representations 
is the number of conjugacy classes. In the algebraically closed case we have the following 


important result. 


THEOREM 5.6. Let G be a finite group, F an algebraically closed field such that char 
F{\G\, Let s be the number of conjugacy classes of G. Then the number of equivalence classes 
of irreducible representations over F (acting on vector spaces V/F) is s and if p4, ..., p; are 


representatives of these classes and n; is the degree of p;, then 


2) |G| = y ni”. 
' 


Proof. The first statement has been proved. To see the second, we use (26) and (27) and the 
fact that since F is algebraically closed, the only finite dimensional division algebra over F is 
F itself. Then A = F[G] = 4, ® ... ® and 4; = M,(F). Then [4:F] = |G| = Sim’ and Theorem 5.4 
shows that n; is the degree of the irreducible representation p; associated with 4; O 


EXAMPLES 


1. Let G = °, the cyclic group of order n generated by z and let F = €. Then A = C[G] is a 
direct sum of n copies of €. Hence we have n inequivalent irreducible representations, all of 
degree 1. It is clear that in the corresponding matrix representations we have z ~ (e^) 
where ( ) is a1 x 1 matrix andr = 1, 2, ..., n. 


2. Let G = D,, the dihedral group of order 2n generated by two elements r, s such that 7” = 


1,s?=1, srs! = r`! (BAL pp. 34, 70). The elements of D, are 7%, r*s, 0 < k < n — 1, and we 
have the relation s7* = r*s. Hence (7s)? = 1, so the n elements “s are of period two. Using 
the multiplication table: ""=""" PPs srs Wik =s sess it is readily seen that ifn is odd 
=2v+l1,v>1, then there are v + 2 conjugacy classes with representatives: 1,7“,1<k<v,s, 
and ifn = 2v, v > 1, then there are v + 3 conjugacy classes with representatives 1, 7%, 1 < k < 
v, $, rs. On the other hand, we can list these numbers of inequivalent irreducible matrix 
representations over C as follows: 


n=2vt+l,v>1. 

pı the unit representation. 

Ph, the matrix representation of degree 1 such that r ~ (1), s ~ (-1). 
O} 1 < l< v, the matrix representation of degree 2 such that 


H 0 ` (" ) 
ro , se 
0 i 1 0 


\ 


where œ = e?’ 


n= 2.02 1. 

p, the unit representation. 

Ph, the matrix representation of degree 1 such that r ~ (1), s ~ (— 1). 
p3, the matrix representation of degree 1 such that r ~ (—1), s ~ (1). 
Pa, the matrix representation of degree 1 such that r ~ (—1), s ~ (— 1). 
The representations o7, 1 </<v-—1, as above. 


It is easy to verify that the representations listed are irreducible and inequivalent. Hence 
they constitute a set of representatives of the equivalence classes of irreducible representations 
of D,,. As a check, we can verify the degree relation (32) in the two cases: 


n= 2v+l, 1+1+4(v) = 2(2v+1) = 2n. 
n=2v, 4(1)+4(v—1) = 4v = 2n. 


We shall consider next a process of extension of the base field of a representation. We need 
to recall some simple facts about extension of the base field of a vector space. For our 
purpose, it suffices to restrict our attention to finite dimensional vector spaces. Thus let V/F be 
an n-dimensional vector space over the field F and let K be an extension field of F. We can 
form the tensor product Vg = K ®p V, which can be regarded as a vector space over K. The 


injective map v ~ 1 ® „ permits us to regard V as contained in Vz as an F-subspace such that 
KV = Vx. Moreover, F-independent elements of V are K-independent and any base for V/F is a 
base for V,/K (see p. 220). A linear transformation / of V/F has a unique extension to a linear 
transformation of Vg/K. We denote the extension by / also. These extensions span Endy Vx as 
vector space over K, that is, Endy Vg = K End; V. Moreover, Endy Vp = K @ p Ende. 

Now suppose we have a representation p of G acting on V/F. Then the defining relations 
P(8182) = p(gi)P(g2), pA) = 1p give the relations p(gigy) = p(g1)p(gz), p(l) = 1p, for the 
extensions. Hence px: g ~ p(g) (in Vx) is a representation of G acting on Vg. We call this the 


representation obtained from p by extending the base field to K. It is clear that if A = F[G], 
then K[G] = Ag and P«(KLG]) = KoA) = plA)x-, We also have the following useful result. 


PROPOSITION 5.2. Eda Vk = K Ending V S (Endanea Vx, 


Proof. The elements of EndpgV (EndxygVx) are the linear transformations of V (Vx) that 
commute with every p(g), g € G Hence it is clear that K Endprg, V c Endxtg, Vx. To prove 
the reverse containment, we choose a base (4;) for K/F. Then element of Endy Vx can be 
written in one and only one way in the form X/,/; where /; e End;V. The conditions that os 
commutes with every p(g) imply that p l; commute with every p(g), hence, that /; € Endrjg] 

Thus 2A,J; € Endgjg] Vx and Endgjgy Vg = K Endri”. We have noted that Endr Vk K@End, y i 


This implies that if 7 is any a a End V/F, iei KE S KOE, In particular, we have 


K Ende, V = K@p Endy, ¥ = (Endro), o 


It is clear that if U is a p(G)-invariant subspace of V, then KU = Ux is a p(G)-invariant 
subspace of Vg. Hence if px is irreducible, then p is irreducible. The converse need not hold, 
as the following examples show. 


EXAMPLES 


1. Let G = @, the cyclic group of order 4 and let F = R. We have the representation of 
degree two over R in which p(g) is the linear transformation with matrix 


| 0 1 ) 
=I 0, 
relative to the base (u, v) of V/®. Since the characteristic polynomial of this matrix is 4? + 1, 


p(g) acts irreducibly on V. Now consider pe. We have the base (z = u + iv, w = u — iv) for Vg/ 
€ and 


Ag)ez = pl(g)ut+ip(gv = —v + iu = iz 
plgìcw = p(g)u—ip(g\v = —~v— iu = — iw. 


Hence €z and € are pe(G)-invariant subspaces. The irreducible representations pe|C7 and pe 
|€ „are inequivalent. 


2. Let G be the quaternion group {+ 1, +i, +j, +k}, which is a subgroup of the multiplicative 
group of Hamilton’s quaternion algebra t over R. Let p be the representation of G such that 
p(g) is the left multiplication x ~ gx, x € M, g € G. p is irreducible since M is a division 
algebra. On the other hand, tç = M,(C) since € is a splitting field for m (p. 228). Since M, (€) 
is a direct sum of two minimal left ideals, it follows that Me is a direct sum of two pẹ(G)- 
invariant subspaces /, and /. The representations pe|/; and pel/, are irreducible and 
equivalent. 

A representation p is called absolutely irreducible if p, is irreducible for every extension 
field K of the base field F. We have the following criterion for irreducibility and absolute 
irreducibility. 


THEOREM 5.7. Let G be a finite group, p a representation of G acting on V/F where char 
F{\G|, A = F[G]. Then p is irreducible if and only if A' = End , V is a division algebra and p is 
absolutely irreducible if and only if A' = F1. 

Proof. Ifp is irreducible, then A’ is a division algebra by Schur’s lemma and ifp is absolutely 


irreducible, then 4** E°4ka'k is a division algebra for every extension field K of F. This 
implies that A’ = F1. For, suppose that A' # F1 and let c e A’, € F1. Then the minimum 


polynomial f(A) of c over F has degree > 1 and this is irreducible since A’ is a division 
algebra. Put K = F[A]/(f(A)) and consider A’y. The minimum polynomial of c € A'g is f(A) and 
this is reducible in K[A]. Hence A’y is not a division algebra, contrary to what we had proved. 
Hence A' = F1. 

Next assume p is reducible, so we have a p(G)-invariant subspace U # V, 0. By Maschke’s 
theorem, there exists a projection p on U that commutes with every p(g). Then p is an 
idempotent + 0, 1 in A’ and 4’ is not a division algebra. Thus if A’ is a division algebra, then p 
is irreducible. Since A’ = F1 implies A'y = K1, it follows that if A’ = F1, then p is absolutely 


irreducible. O 


A field F is called a splitting field for the group G if every irreducible representation of G 
over F is absolutely irreducible. We have the following 


THEOREM 5.8. Let G be a finite group, F a field with char Ft\G\. Then F is a splitting field 
for G if and only if F{G] is a direct sum of matrix algebras M,(F). 


Proof. Let A= F[G] = A,®...®A, where the A; are the simple components of A and let J; be a 
minimal left ideal of A; The representation p; of G acting on J; is irreducible and every 
irreducible representation of G over F is equivalent to one of the p;. Hence F is a splitting 
field for G if and only if every p; is absolutely irreducible. By Theorem 5.7, this is the case if 
and only if End ,/; = F1 for 1 <i < s. Now End,J/, = A’; is a division algebra and by Theorem 
5.1, PA) = Endl, = M.(4) where 4, = 4?" and n; is the dimensionality of J; as vector space over A',. 
On the other hand, since the 4;,  # i, annihilate J; and A; is simple, °“)* 4". Thus “= Ma8.. 
Now suppose F is a splitting field. Then A'; = F and 4:= M,(F)., Conversely, suppose ^A = M«(*) 
for some n';. Then “«(F) > Ma (A). By the isomorphism theorem for simple artinian algebras, this 
implies that n'; = n; and & = F. Then A; = F1, End, J; = A'; = F1, and F is a splitting field. O 


EXERCISES 


1. Let G be finite, p an irreducible representation of G over F where char FtG. Show that 
L «Pla = Gil if p is the unit representation and >««)=° otherwise. 


2. Let G be a finite group, F = Q or Z/(p) where PtiG.. Show that there exists a finite 
dimensional extension field of F that is a splitting field for G. 
3. Let p; and p, be representations of G over an infinite field F and let K be an extension 


field. Show that if p;x and px are equivalent, then p, and p, are equivalent. Does this 
hold for F finite? 


5.4 IRREDUCIBLE REPRESENTATIONS OF S, 


It is generally a difficult problem to determine the structure of the group algebra F[G] for a 
given finite group G. As we saw in the previous section, if char FtlGl, this amounts to 
determining a set of representatives for the equivalence classes of irreducible representations 
of G over F, or, as we shall now say more briefly, determining the irreducible representations 
of G over F. In this section, we give one important example for which this can be done: G = 
Sa We shall determine a set of idempotents generating minimal left ideals of F[S,,] (char Fin) 


that give the irreducible representations of S, over F. These results are due to Frobenius and to 
A. Young; our exposition follows one due to J. von Neumann as presented in van der 
Waerden’s Algebra vol. 2, p. 246. 

We recall that the number of conjugacy classes of S, is p(n) the number of (unordered) 
partitions of n (BAI, p. 75). If {r}, 72, ..., 7,} is such a partition (that is, r; > 1 and Eir: = n), 
then the permutations that are products of disjoint cycles of lengths r4, 75, ..., rp form a 
conjugacy class and every conjugacy class is obtained in this way. For the partition a = {r}, 7, 
.. Fp} We assume "772°" and we use this order to order the set of partitions 
lexicographically. Thus if J = {5), S2, ..., Sp} with 1 7%22°"?% then a > p ifr; > sat the first 
place where r, 4 s; Each partition a = {r}, r2, ..., rp} defines a (Young) tableau 


[a tl «@ ti @ « D r, boxes 


(3) G GOU ss a {H r, boxes 


W U we H r, boxes 
and each tableau defines a set of diagrams D, E,... obtained by distributing the numbers 1, 2, 


..., n in the boxes so that no two numbers appear in the same box. For example, if a = {3, 2, 
2}, then one ofthe D is 


0 B o 
(34) 2) (6 
4 (5. 


IfD (=D ) is a diagram, we define the group R(D) of row permutations of D to be the 
subgroup of S,, of permutations stabilizing the subsets filled in the rows of D. Thus, for D as in 


(34), R(D) is the set of products of cycles all of whose numbers appear in one of the rows {1, 
3,7}, {2, 6), or {4, 5} (so RW) = {1, (13), (17), 37), 137), 173), (26), (45), (13), (26), 


etc.}). Similarly, we define the subgroup C(D) of column permutations of D to be the subgroup 
of S,, of permutations stabilizing the columns of D. 


If D, is a diagram and o € S,,, then oD, is the diagram obtained from D, by applying o to its 


entries. Evidently this is an E, and every E, is a oD, for some ø €e S,. It is clear that 
R(oD,) = oR(D,)o™' and C(aD,) = aC(D,)o™*. 


With each diagram D ( = D,) we shall associate elements of the group algebra F[S,,] as 
follows: Put 


S= ¥ o A=} gth, 


Fp = ApSp = A (sg t)ta 
aeR(D) 
treC(D) 


where sg t denotes the sign of the permutation t. Evidently, Sp + 0 and Ap # 0 in F[S,]. But 
also Fp # 0. To see this, we observe that R(D) N C(D) = 1, since an element common to these 


subgroups stabilizes every row and every column and hence fixes every element in {1, 2, 
...M}. It now follows that if o}, 0. € R(D) and ti, t) E C(D), then t,o, = t20, implies 
a '=% ‘ty and o) = o> and q) = tz. Thus the products to appearing in Fp are distinct and 
hence Fp # 0. 

Since R(PD)=pR(D-' and CleP)=eC(P\p' for p e S, and pte '=set , it follows that 
Spo = PSvP's Apo = PAP”', and Feo =eF op‘, Also if o e R(D) and t € C(D), then the definition 
(35) gives 


(36) osp =Sp= Spo,  TtAp = (88 T)Ap = Apt. 


The main properties of the elements Sp, Ap, Fp will be derived from the following 


combinatorial result. 


LEMMA 1. Let a and $ partitions such that a = p and let D, and Ex be associated 
diagrams. Suppose no two numbers appearing in the same row in D, are in the same column 
of Ep. Then a = p and E, = otDz for some o € R(D,) and te C(D,). 


Proof. The number of entries in the first row of D, is the same as or exceeds that in the first 
row of Ep. If greater, then since the number of columns of Ey is the number of entries in the first 
row of Ex, two entries of the first row of D, occur in the same column in Es, contrary to 


hypothesis. Hence both diagrams have the same number of entries in the first row. Also we 
have a column permutation zc’; of Eg so that the first row of tr’; Eg has the same entries as the 


first row of D,. Next we note that the entries in the second row of D, occur in distinct columns 


of ct’; Eg and in rows after the first. It follows that D, and 7’|E, and hence D, and Ex have the 
same number of entries in the second row and a column permutation 7’, of t'} Eg (and of Ep) 


brings these into the second row. Continuing in this way, we see that 2 = a and there exists rt’ € 
C(E,,) such that the entries of each row of t'E, and of D, are the same. Hence there is a ø € 


R(D,) such that oD, = TE, Now re C(E,) = C(t'E,) = C(aD,) = oC(D,)a~* Hence t= gr ‘a *. t 'eC(D,) 


and ot 'o"'E, = oD,.. Then E= oD, 


Now assume a > p. Then Lemma 1 implies that there exist i, j, i # j, ina row of D, and ina 
column of Ep. If z = (ij) then z e R(D,). Then, by (36), nSp, = Sp, = Spx and nAg, = -AER Hence 


Sp,Ar, = (Sp,7)Ar, = Sp, (tAg,) = —Sp Ag 
Ar Sp, = Ar (1Sp ) = (Ap 1)Sp, = -Ag Sp. 


Hence *9.4& =°= Asb.. If p is any element of S,,, then Spo, = PS0P™ and since Sev.Ae, =9= Ar Syp,' 
we have 50? "Ap = Oand Ay pSp = 0. _ Thus 
(37) Sp,FLS,JAe, = (= Ar FLS,|Sp, if a> p. 


LEMMA 2. An elementa e F{S,] satisfies the equations tao = (sgr) a for all o e R(D) and 
tE C(D) if and only if a = yFp, y € F, Fp = Ap Sp. 


Proof. We have Fpa = ApSpo = ApSp= Fp and tFp= tApSp = (sgt)ApSp = (sgt)F o, Hence any yFp, y 


€ F, satisfies the conditions. Next let ° = 2s.7» satisfy the given conditions. Then 


(38) Yo = (SB Trp 


for o € R(D) and t € C(D). In particular, y,, = y1 sg T, so if we can show that y, = 0 if is not 
of the form to, tT E C(D), o e R(D), then we shall have a = y, Fp by (35). Hence suppose 
p # ta for teC(D), ERD), Then P’ * 77S RID) teci). Then p-!D # otD and Lemma 1 implies that 
there exists a transposition ‘neR(D), xeCip"'D) = p"'C(Dp, Then m = p 'n'p where z’ is a 
transposition contained in C(D). Then p = z'pz and »=» =» = =} by (38). Hence y, = 0 and the 
proof is complete. o 

Now let x e F[S,] and consider the element Fp x Fp = Ap Sp x Ap Sp. By (36) we have 
tFpxFpo = sgtFpxFp for o € Rp, t € Cp. Hence, by Lemma 2, Fp x Fp = y Fp for y e F. In 
particular, Fp? = yFp. We proceed to show that y + 0. For this purpose, we consider the map x 
~ F px of F[S,] into itself. If y = 0, Fp? = 0 and the map x ~ Fpx is nilpotent and hence has 
trace 0. On the other hand, ifp € S,, then if we look at the matrix of x ~ px relative to a base, 
we see that the trace of this map is 0 if p + 1 and is n! ifp = 1. Since the formula (35) for Fp 
shows that the coefficient of 1 in the expression for Fp is 1, the trace of x ~ Fpx is n! 4 0 


(since char Ffn'), 

We now put ep = y! Fp. Then ep? = ep £ 0 and ep FS, ]ep = Fep. Also, if a and £ are 
distinct partitions, then *»,flS.Jee, =0 follows from (37) if D, is a diagram associated with a and 
Ep is one associated with 2. 

We recall that if e and f are idempotents of a ring A, then the additive groups hom,(Ae, Af) 
and e4f are isomorphic and the rings End ,4e and e4e are anti-isomorphic (p. 180). If we apply 
this and Theorem 5.7 to the representations of G acting on F[S,]ep , we see that this 
representation is absolutely irreducible and that if a + p, then the representations provided by 
F [S,lep, and F' [Sn lez, are inequivalent. Since the number of conjugacy classes of S, 1s p(n), 


we obtain in this way a full set of representatives of the equivalence classes of irreducible 
representations. In terms of the group algebra F[S,,] the result we have proved is 


THEOREM 5.9. If F is a field of characteristic 0 or of prime characteristic exceeding n, 
then 


F[S,] = M, (F)®M,,(F)®: OM, (F). 


The method of proof is constructive and in theory it can be used to carry out the 
decomposition of FLS,] into simple components. The determination of the n; which are the 


degrees of the irreducible representations, can be made by calculating the characters of S,, as 


defined in the next section for an arbitrary group. There is an extensive literature on the 
characters of S,,. We shall not consider any of this here. Evidently, Theorem 5.9 has the 


following consequence. 


COROLLARY. The field Q and any field 2/(p) with p > n is a splitting field for S,,. 


5.5 CHARACTERS. ORTHOGONALITY RELATIONS 


DEFINITION 5.2. Ifp is a representation of a group G acting on a vector space V/F then 
the F-valued function on G defined by 


Xp:9 > tr plg), 


where tr p(g) is the trace of the linear transformation p(g), is called the character of G 
afforded by p. If p is irreducible, then x, is called irreducible and if F =€, then x, is called a 


complex character. The degree of p is called the degree of x,. 


As we shall see in a moment, two representations of G over € are equivalent if and only if 
they have the same character. Moreover, a great deal of experience has shown that the 


characters encapsulate precisely the information on the representations that is useable for the 
applications. For these reasons, it is fair to say that the central problem of representation 
theory is that of determining the complex irreducible characters of a given group, or, more 
precisely, of developing methods for this purpose. 

We begin by listing some simple facts about characters. 


1. Equivalent representations have the same character. If, on V; is equivalent to p, on Vz, 
then there exists a bijective linear map 7 of V, onto V, such that p,(g) = np (g) }. This 
implies that trpo(g) = tro (2) and x, = Xp. 


2. Any character is a class function, that is, it is constant on every conjugacy class and 
hence it defines a map of the set of conjugacy classes into the base field. Let g, h € G. Then 
tr p(hgh~*) = trpth)p(g)pth)”* = trpig). Hence z(hgh” ') = x,(9)-. 

3. If char F = 0, then the degree ofp is y,(1). This is clear since p(l) =1,, so x,(1) = tra(1) = 
dim V. 

4. Let Ube a p(G)-invariant subspace of the space V on which p acts and let p|U and p|V/U 
be the corresponding subrepresentation and factor representation. Then . 


LAG) = Xou l9) +Xpiviulg)- 


This follows by choosing a base (u4, ... , u„) for V such that (u, ... , u,) is a base for U (and 
hence (u, į + U, ... u, + U) is a base for V/U). Then the matrices of the p(g) all have the 
reduced form (18). The result follows from this. 

5. If p, and p, are representations of G, then 


Xp,@p. = Xp,Xp, 


that is, for every g € G, #89) = %.'9%:'9), To see this, we refer to the matrix (12) for the linear 
transformation a,® a, in V; ®@ V, where a; is a linear transformation in V,. It is clear from this 


that tr (a; ® a>) = (tr aj) (tr az). Hence #r8r: = %o%on, 


For the applications we are interested primarily in complex characters. We shall now note 
some of their properties. 


6. If G is finite, then any complex character of G is a sum of mth roots of unity where m is 
the exponent of G, defined to be the least common multiple of the orders of the elements of G. 
If g € G, then g” = 1 and hence p(g)™= 1. Hence the minimum polynomial of p(g) is a factor of 
4” — 1, and so it has distinct roots that are mth roots of unity. It follows that p(g) has a matrix of 
the form 


(39) diag {@1,W ,..+,@,} 


where the œ; are mth roots of unity. Then x9) = Le. 


There are several useful consequences of this result. First we have the following 


PROPOSITION 5.3. Let p be a complex representation of degree n of a finite group G. 
Then for any g€ G 


(40) Iz, (g)| < n = deg y, 


and 


(41) 1,(g)| =n 


if and only if p(g) = œl where œ is an mth root of unity, m the exponent of G. In particular, 


if 
(42) yg) =n 


then p(g) = 1. 


Proof. We have “= ~i® an mth root of unity. Then kW) = iol < Diled = =, Moreover, equality 
holds if and only if all the œ; are on the same ray through the origin. Since they are on the unit 
circle, this holds if and only if they are equal. Hence kA) =" if and only if p(g) = @1. The last 
statement is an immediate consequence of this. © 


The fact that 49) =" implies p(g) = 1, and the obvious converse of this leads us to define 
the kernel of the character y, as the set of g such that |“) =" (the degree of p). Then ker y, = 


ker p is a normal subgroup of G. Also we define 


Z(x,) = {g€Gllz,(g)l = n}. 


Then we have shown that Z(7,) is the set of g such that p(g) = w1. It is clear that these form a 
normal subgroup of G containing ker y, and Z(y,)/ker p is isomorphic to a subgroup of the 
multiplicative group of rath roots of unity. Hence Z(y,)/ker p is cyclic. 

Another important consequence of the proof of property 6 is 


7. Let p be a complex representation of a finite group G, p* the contragredient 
representation. Then 


(43) X = Up 


(that is, %(9) = x9). g€6)., To see this, we suppose g € G and we choose a base B in V such that 
the matrix of p(g) relative to this base is (39). Then the matrix of p*(g) relative to the dual 
base B* (p. 250) is ‘(diag {@,,...,0,}) 1 diag{w;' grees o, E = diag {@,...,@,} Hence X(9) = Ye, = 14g). 


EXAMPLES 


1. Let 1 be the unit representation: V is one-dimensional and l(g) = ly, g € G. The 
character afforded by this is the unit character y;:g ~ 1 e F. 


2. Let G be finite, p the regular representation of G. To determine y, we use the base G = 
{21 = l, 82 .--, &,} for FLG]. We have x,(1) = n. On the other hand, if i > 1, then all of the 
diagonal elements of the matrix of p(g;) relative to the base G are 0 since gig; # gj. Hence, 
%p(i) = 0. Thus the character of the regular representation is given by 


(44) YA) =IGll, x9) =0 if g#1. 


3. Let G = D,„, the dihedral group of order 2n generated by r, s such that 7” = 1, s? = 1, srs”! 


= rl, If we refer to the results given in example 2, p. 261, we obtain the following character 
tables: 


n= 2v+l, Y>l 


Le: I = I l 
Ie, | 2 O 2cos2kir/n 


-) 1 (-1} 
0 0  2cos2kir/n 


In these tables, the representatives of the conjugacy classes are listed in the top row and the 
rows correspond to the irreducible representations given before. In both cases, 1 < k < v, and 1 
</]<vifn=2v+l11l<l<v-—1ifn = 2v. 


We shall derive next some fundamental orthogonality relations connecting the irreducible 
complex characters of a finite group. We consider first a more general result in the situation in 
which G is a finite group, F a splitting field for G with char FtlGl. Let {p), ..., p,} be a set of 


representatives of the (absolutely) irreducible representations of G over F and suppose p; acts 
on V, 1 <i <s. We assume also that p, is the unit representation. Let 1 < i, j < s and consider 
hom,(V;, V;). We have a representation p, of G acting on hom,{V;, V;) obtained by defining 


(45) pig)! = p,(g lpg)’ 


for / e hom,(V;, V;). It is clear that this gives a representation of G (exercise 2, p. 251). Now 
let / be any element of hom,(V;, V;) and form the element 


(46) nD =F p,(g)l =} polpo 
g g 


We have 
piht) =} pjlhg)lp.(g) ' = ¥ poloh g) 
g 


g 
= ¥ pig)lo(g” ‘h) = n(Dp:(h). 
g 


This implies that 7(/) is a homomorphism of V; regarded as F[G]-module into V, regarded as 
F{G]-module, that is, n(/) € homyg(V;, V;). Now if i +j, then V; and V; are ae and 
non-isomorphic F[G]-modules. Hence by Schur’s lemma, 


(47) » pilgipilg) =0 
g 
ifi #j and / is any element of hom,{V; V;). Next suppose i =j. Then y(/) € Endyg] V; = F1, so 
we have 
(48) > pilg)lp,(g) EFI 
g 


for any / e EndpV;. We can use (47) and (48) to derive the following result, which is due to 
Schur. 


THEOREM 5.10. Let G be a finite group, F a splitting field for G with char Ft\Gl. Let {p}, 
..., Ps} be a set of representatives of the equivalence classes of irreducible representations 
of G over F and for each i let p be a matrix representation given by p; Then char Ftdegp, 
and we have the following relations: 


D pa (gpg ')=0 if iF; 
(49) 
L pila (pË lg) = by dylGl/deg p;. 


(0"(g) = (p(g))). 


These relations are called the Schur relations. 


Proof. Let 0-1) be a base for the space V, on which p; acts so that eau” = Leia” and let 
fi, be the element of hom,(V;, V;) such that f= buu”. Then Perla” Wu” = Sapna” Vela uy? 
Then (47) implies the first set of relations (49). Also for j = i, the foregoing relations and (48) 
give 


DIA pigu = Apu, A, EF. 


Hence 


(50) 2 PG PRO) = Ôrråir 


Put t = k in these equations and sum on k. This gives 
(deg p;)Ż = Òn lG] 


which shows that char ‘#8? and 4» = ònlGl/degp;, Substituting this in (50) gives the second set of 
Schur’s relations. © 


We have %.(9) = 2sil9)-, Hence if we put / = k and t =r in (49) and sum on k and r, we obtain 
-| . . è 
Lig (g)y,g°')=0 if ij 
(51) 
2 X» (D)xy,(9') = |G]. 


Now suppose F = €. Then it is clear from the fact that pg) has a matrix of the form diag 
{@1, ..-, @,} where the œ’s are roots of unity (see property 6 above) that %.g"') = x9), Hence 
we obtain from (51) the basic orthogonality relations for irreducible complex characters: 


È tn Q)%pg)=0 if ižj 
(52) 
: 2 
> lx, Ø) = IG. 
g 
We now consider the complex vector space €% of complex valued functions on G. We have 


the usual definitions of addition and multiplication by complex numbers: If 
owe, then (p +4)(g) = olg)+ Yig) and (ay)(g)= avlo) for q e €. We define a hermitian form on €° by 


3 ( 
(53) (oly) = = (gb (9). 


Then 


| 
(lo) = ~ Slog)? > 0 
|G| ~ 
and equality holds if and only if g = 0. Hence (g|y) is positive definite. We shall now write y; 
for Kas Then the relations (52) are equivalent to 


(54) (xilx;) i dip 


These state that {y,, ..., Xs} is an orthonormal set of vectors in CO, that is, they are mutually 


orthogonal and all have length one. It is clear from this that the irreducible complex characters 
are linearly independent over €. We have observed that characters are class functions. The set 


of class functions forms a subspace cf (G) of CC whose dimensionality is the number of 
conjugacy classes. We have seen that this number is the same as the number s of irreducible 
representations over €. Hence, it is clear that the irreducible characters constitute a base for cf 


(G). 
Now let p be an arbitrary complex representation of G, y its character. We have defined the 
multiplicity m; of p; in p on p. 255. It is clear from the definition and from the fact that the 


representations are completely reducible that two representations are equivalent if and only if 
for every i = 1, 2, ..., s, the multiplicity of p; in the two representations is the same. It is clear 


also that we have the formula 
(55) X = MX, tM2X2 + +MY, 
where m; is the multiplicity of p; inp. Now, by (54), 


(56) m; = (7;|%): 


Hence the m; are determined by y and consequently complex representations are equivalent if 
and only if they have the same character. Also, by (55), we have 


(57) (x12) = Xm? 


and this has the value 1 if and only if one of the m; is 1 and the rest are 0. This shows that a 


character is irreducible if and only if it has length 1 in the hermitian metric for CC. 
Let Ci = {1}, G, ..., Cs be the conjugacy classes of G and let h} = |C]. We now write 7;; 
for y€), g € C,. Then the orthogonality relations give 


IG\d;; = © xi(9)x,(9) 
=} 2 19) 44(9) 
k=1 geC, 


=E Lik Ny Xin 


Thus we have 


(58) Li it Lin = 1G10j;. 


Let X= (xj) € M,(€) and let H = diag {/, ..., As}. Then the foregoing relations amount to the 
matrix equation 


(59) XH'X = |G|1. 


From this, one deduces 'X¥ = IGIH-*, which is equivalent to 


|G| 
(60) 5 XijXik = Tt f 


i=1 


We shall call (58) and (60) the row and the column orthogonality relations respectively for 
the characters. We remark that if we take j = k = 1 in (60) we obtain =" = IGI for m = x1) = des ze, 
This is the relation (32) that we had obtained in a more general situation by using the theory of 
algebras. 

If p is a representation of a group G acting on V/F, then p has a unique extension to a 
homomorphism of F[G] into End;-V. This maps the element 2s%9 of the group algebra into 
~7,9), We shall denote this extension by p also. Similarly, the character y, defined on G can be 
extended uniquely to a linear map xy, of F[G] into F by putting *««#). If c e cent F[G], the 
center of the group algebra, then p(c) is in the centralizer of p, so if p is absolutely irreducible, 
then p(c) = œ(c)1 where w(c) € F. If the degree ofp is n, then x,(c) = na(c) so 


| 
(61) p(c) = X01. 


Since the map c ~ p(c) is an algebra homomorphism of cent F[G] into F1, it is clear that c ~ 
(1/n)%,(c) is an algebra homomorphism. 


We have seen that if we put "= 2G, then (cj, ..., c,) is a base for cent F[G]. 
Evidently, @=4"vrtu-~-ve'# where the nj. E€ N, the set of nonnegative integers. Since cic, € 
cent F[G], we have h'ech = Cy, h € G. This gives "a= "sow aheG, Tt follows that we 
have a multiplication table of the form 

5 
(62) ëe = ) myc, 1<jk<s 
l=1 
where mj; E N for the base (c, ..., cs) of centF[G] over F. It is readily seen that m,¿; is the 
number of pairs (x, y), x € C;, y € CG, such that xy is a given z in C). 

Now suppose F =€ and let the notation be as before. We apply the representation p; to (62) 
to obtain Peeler) = Emupla), If n; = deg p; = 7,1), then we have te) = ale), Hence we have 
the character relation 


(63) (z;(c;)/nj) (1:(C,)/n;) = LM jr ZilCy)/N; 


If we use the definition of “= «ce. and the fact that characters are class functions, we obtain 
zile) = hyxd9)) for any gj € Cj, so x%(c;) = hi. Hence we have the character relation 


(64) (h;7;)/N;) (Ny Zig/ Ni) = » mM (hi Xiy/Ni) 
l=1 

or 

(64) (h;7;)) (hy Xin) = » Mju YN; Xir 


=} 


By a character table for a group G, we mean a table giving the values yij for the 
irreducible complex characters. A fundamental problem for the study of a given group G is the 
computation of its character table. As an illustration of this, we consider the following 


EXAMPLE 


We wish to determine a character table for S4. The following is a list of representatives for the 
conjugacy classes: (1, (12), (123), (1234), (12) (34)). We denote the corresponding conjugacy 
classes as (C, Cy, C3, Cy, Cs). Their cardinalities are respectively (h4, A2, h3, hy, hs) = (1, 6,8, 
6,3). The alternating group 44 is a normal subgroup of index 2 in $4, and V = {1, (12) (34), 
(13) (24), (14) (23)} is a normal subgroup of index 6 in S, (BAI, p. 261, exercise 4). If we let 
S3 denote the subgroup of S; fixing 4, then S4 is a semi-direct product of V and $3: Any element 
of S4 can be written in one and only one way as a product sv where s € S3, ve V. Ifs; € S3 and 


v; € V, then (s1v1)(s5¥>) = (8182)(s7 1v1S2V2) and sy !v1s € V. Hence sv ~ s is a homomorphism 
n of S4 onto S, with kernel V. If p is a representation of S}, then py is a representation of S4 
whose character is 7,7. 

Now S3 = D} under a map such that (12) ~ s, (123) ~r. Hence the character table for D, 
gives the following character table for $3: 


1 (12) (123) 


mii ot l 


yj! —i l 
%12 0 -Í 


We denote the characters obtained by composing this with 7 again by 7, X2, 73 Since. (12) (13) 
(24) = (1324), the part of the character table obtained from these characters is 


1 (12) (123) (1234) (129034) 
mii 1 1 E F 
xil ~-i I —İ 
20 + Ø 


N o- = 


If n; is the degree of y; 1 < i < 5, then In? = 24 gives n + n = 18. Hence ny = n; = 3, so 
the missing two irreducible characters are of degree three. Hence the last two rows of the 
character table have the form (3, a, p, y, ô) and (3, a’, f’, y', 6’). If we use the relation (60) 
withj = 1 and k = 2, 3,4, 5, we obtain 


s+ =0, B+8=0, yty'=0, ö+ð= -2 


Hence the last row is (3, —a, —$, —y, — 2 — ô). If we use (58) with i = 4 and j = 1, 2, 3, we 
obtain the relations 
3+62+884+6;4+35 = 0 


3— 6z +88 -6y +35 = 0 
6 -88 +65 = 0, 


These equations give f = 0, 6 =— 1, y = —a. The orthogonality relation (60) for j = 2, k = 4 
gives %7 =— 1. On the other hand, a = y,(12) is a sum of square roots of 1, hence real. Thus %7 
=a(— *) =—a?=—1 and a?=1, so a =+ 1, y = 71. Thus the last two rows are either (3, 1,0, 
— 1, — 1) and (3, — 1, 0,1, — 1) or (3, — 1, 0,1, — 1) and (3, 1,0, — 1, — 1). Both determinations 
give the same table except for the order of the last two rows. 


The foregoing example illustrates how the orthogonality relations plus other information 
that one can get a hold of can be used to calculate a character table. Further results that we 
shall derive presently will supply additional information useful for calculating character 
tables. We should note also that there is a substantial literature on the characters of S,, 


beginning with a classical paper published by Frobenius in 1900 that gives formulas for the 
characters of any S,,. See, for example, Weyl’s Classical Groups, p. 213. 


EXERCISES 


— 


. Determine a character table for 44. 


2. Determine a character table for the quaternion group. 


3. Let G be a subgroup of S,,, p the corresponding permutation representation of G over €, y 
its character. Show that >«z@)="@ where r is the multiplicity of the unit representation 


pı inp. Show that 2««x'#) is also the total number of fixed points in {1, 2, ..., n} for all g 
E G. 
4. Let the notations be as in exercise 3. Show that the number of orbits of Gin {1, 2, ..., n} 


is the multiplicity r ofp, inp. 

5. A permutation group G is called k-fold transitive for k = 1, 2, ... if given any two k- 
tuples (71, i>, ..., i) and (jis jo, ...,7,) of distinct is and js, there exists a g e G such that 
gi, =j 1<1<k. Let G be doubly ( = 2-fold) transitive, p the corresponding permutation 
representation. Show that p ~ p; ® p; where p; is an irreducible representation £ p4. 


6. Let 7), ..., Xş be the irreducible complex characters of G and C) = {1}, Cz, ..., C, be 
conjugacy classes of G, A; = |Cj. Put «= 2e«9 and «= LMNs) where x; = x,(g) for some 
g € C. Show that the e; are orthogonal idempotents in the center of C[G] and Ze; = 1. 


7. Show that an element g in a finite group G is conjugate to its inverse g`! if and only if 
x(g) € R for every irreducible complex character y. 


5.6 DIRECT PRODUCTS OF GROUPS. CHARACTERS OF ABELIAN 
GROUPS 


It is easy to see that if G, and G, are finite groups, then F[G, x Gz] = F[G,] @ p FIG]. If char 
F{|G| and F is a splitting field for G} and G,, this can be used to reduce the study of the 
representations over F of G, x G, to that of the components G;. In the most important case, in 
which F = €, we can obtain the results also by using characters. We shall follow this 
approach. 

Let p, i = 1, 2, be a representation of G; acting on V; over any F. If we compose the 
projection (g), 25) ~ gı with p,;, we obtain a representation p', of G) x Gz. Evidently G, c 
kerp',. Similarly, we obtain a representation p’, of G, <x G, by composing the projection (g4, 
22) ~ Z> with the representation p, of G>. We now form p'; ® p'5, which we denote as p, # p>. 
Then 


(65) (01 # P2) (91,92) = P91) ®@P2(9o) 


and hence 


(66) Lv, #991592) = Lo (9%) (G2). 


We have the canonical imbeddings g, ~ (g, 12) and gy ~ (14, 22) of G, and G, in G, x G,. Then 


(67) Loi #p2(91> 12) = Xp (91)Xp (12) = (deg p2)%,,(91). 
Similarly, 
(68) j R T h P = (deg p;)7,,(92): 


All of this has an immediate extension to direct products of more than two factors. 
Now suppose the G; are finite, F = €, and p; is an irreducible representation of G;. Then we 
have 


y F PMR (91,92)! 


1592) 


V AN > P YA )= IG, x X G,| ü 


= qo (91)%p(G2)17 
GIG, PESON ES 


1:92 


7. lea i0) 


1:92) 


| E 
T FPA baal SPAT e) 
Gy (Eel) (Zu 


=s l An a |; 


Hence p, # p> is an irreducible representation of G) < G,. Next suppose the irreducible 


~ GIG 


characters of G; are y} , ... , y, © and let p, be a representation affording y,. The 
representations p,” # p/?) of G, x G, are irreducible and (67) and (68) imply that p,{)) # p/? 
and p” # py have distinct characters and hence are inequivalent if (k’, 1) # (k, D. Hence 
we obtain in this way s}, s) inequivalent irreducible representations. Since the degree of p,” 
# p{ is the product of the degree of p,™® and the degree of p/?), the sum of the squares of the 
degrees of the p,“) # p/ is the product of the sum of the squares of the degrees of the 
irreducible representations of G, and the sum of the squares of the degrees of the irreducible 
representations of G>. This is |G,||G| = |G, x G|. It follows that the set of representations 


fp, # p} is a set of representatives of the equivalence classes of the irreducible 
representations of G, x G. This proves 


THEOREM 5.11. Let G, and G, be finite groups, {p\, ..., p.} a set of representatives of 
the equivalence classes of irreducible representations over € of G;. Then every p,® # p is 
an irreducible representation of G, x Gy and {p, # p{ is a set of representatives of the 
equivalence classes of irreducible representations of G) X Gp. 


A character y of degree one is called /inear. Evidently, such a character is irreducible and 
may be identified with the representation affording it. Thus y is a homomorphism of the given 
group into the multiplicative group of a field. Conversely, any homomorphism of a group G 
into the multiplicative group F* of a field F is a linear character of G. We recall that these 
characters have played an important role in the Galois theory of fields (BAI, p. 291). If y and 
x' are linear characters of G, then y,y, defined by (yy')(g) = x(g)y'(g) is a linear character. In 
this way the set of linear characters of G forma group with the unit character: g ~ 1 as unit and 
the inverse of y as g ~ y(gy!. 

If G is an abelian group, every irreducible complex character of G is linear. For, if p is an 
irreducible representation of G acting on V over €, then Ende, GV = €l and since G is abelian, 
Env p c EndeygV so Env p = €l. Then any subspace is p(G)-invariant and since p is 
irreducible, V is onedimensional. Hence y, is linear. Since G is abelian, every conjugacy class 
consists of a single element; hence the number of these is |G|. Then G has |G] irreducible 
complex characters. 

The last result can also be seen without using representation theory. In fact, we can easily 
determine the structure of the group of irreducible (= linear) complex characters of any finite 
abelian group. If G is any abelian group, the group of irreducible complex characters is called 
the character group of G. Now let G be finite abelian. Then G is a direct product of cyclic 
groups (BAI, p. 195): G = Gi x G x ... x G, where % = <) and g; has order e;. We may 
assume the G, are subgroups and every element of G can be written in one and only one way as 


g=91"'92"""" Where 0<k,<e,. Let x¢@, the character group of G. Then | #40" = 2") = 20) =! so 
x(g;) is an e;th root of unity. The set of these is a cyclic subgroup Z, of the multiplicative group 


c*. We now define a map of G into Z,, x Ze, x ... x Ze by 


It is clear that this is a homomorphism. The kernel is the set of y such that y(g;) =1, 1 <i<r. 
Then 2(9:"'*"'@"") = x01)" x0)" = | and y = 1. Hence (69) is a monomorphism. Moreover, the map 
is surjective. For, if (uj, ..., u,) E Ze, X ... X Ze, then ufi = 1 and hence we have a 


homomorphism of $% into Z,. sending g; ~ u;. Then we have a homomorphism x of G= G} * ... 


x G, into C* sending g; ~ u; 1 <i < r. Evidently, y is a character that is mapped into (up, ..., u) 
by (69). Hence we have an isomorphism of G with its character group 6. 


EXERCISES 


1. Show that the number of complex linear characters of a finite group is the index [G: G", 
G” the commutator group of G. 


2. Let G be a finite group, p the regular representation of G over the field F. Form the field 


F(Xg1, +++» Xom) in m-indeterminates where m = |G| and g; ~ xg; is 1-1. The determinant 


(70) det ) x,p(g) 
geG 


is called the group determinant of G over F. (The study of such determinants was one of 
the chief motivations for Frobenius’ introduction of the theory of characters.) Show that 
if F = € and G is abelian, then 


(71) d(Y s0) = || 2 va) 


geG yeG 
This result is due to R. Dedekind. 


geG 


3. Let Fab denote the category of finite abelian groups with homomorphisms as morphisms. 
We have a contravariant functor D from Fab to itself such that 4~4 and iff: A — B, then 
f:B~ Ais defined by y ~ y where y(x) = w(f(x)), x € A. Show that 07:4 A4./- is naturally 
equivalent to the identity functor (cf. p. 22). 


4. Show that if H is a subgroup of a finite abelian group G, then the subgroup of ĉ of y such 
that 77; = ly can he identified with ei. and hence its order is |G/H]. Use this to show that 


the map y ~ yy of ĉ into # is surjective. Note that this implies that any linear character 
of H can be extended to a linear character of G. 


5.7 SOME ARITHMETICAL CONSIDERATIONS 


In this section, G will be finite and all representations and characters are complex. We shall 
apply some elementary results on integral complex numbers that were given in BAI, pp. 279- 
281, to obtain important results on the degrees of the irreducible representations and on the 
characters of S,,. 


We recall that a € € is called algebraic if a is algebraic over the subfield Q, that is, a is a 
root of a non-zero polynomial with coefficients in @. The complex number a is called integral 
(or an integer) if a is a root of a monic polynomial with integer coefficients. A useful criterion 
to prove integrality is that a is integral if and only if there exists a finitely generated Z- 


submodule M of € such that 1 € M and aM c M. The subset A of € of algebraic numbers is a 
subfield and the subset / of integral complex numbers is a subring. Moreover, if a € € is a root 
of a monic polynomial in A[A] (in /[A]), then a € A (J). We showed also that a is integral if and 
only if a is algebraic and its minimum polynomial over @ has integer coefficients, and that the 
only rational numbers that are integral in € are the elements of Z. 

We have seen that the characters of G are sums of roots of unity (property 6 on p. 270). 
Since roots of unity are integral, it follows that y(g) is integral for any character y and any g € 
G. We recall the following notations that were introduced in section 5.5. 


1. Xis -+ Xs are the irreducible characters. y; is the unit character, that is, the character of 
the unit representation p4. 


2.C, = {1}, C, ..., C, are the conjugacy classes. 


3. Xj =Xlgj) where g, € C. 
4. n;= yj. This is the degree of the representation p; such that Ie te 


Soh =C: 
We recall also the following character relations (p. 276): 
5 
(64) (h;x;;/n;) (hy Xix/Ni) = 2 mM palh Xun) 


l=1 


where the mj,; € N. We shall deduce from these equations 
PROPOSITION 5.4. The complex numbers h,%;;/n; are integral. 


Proof. Fix i and put uz = hyyj/nj, 1 < k < s, ug = 1. Let M = 2044", Then (64) shows that u, M c 
M. Hence uy is integral by the criterion we noted above. 


We can now prove 
THEOREM 5.12. 1,||G\. 


Proof. We use the formula (58) for i =j to obtain lGl/n: = 2e%azale/m-, Since the Ze and %aħ/" are 
integral, so is |G\/n;. Since this is rational, it is contained in Z. Hence n,||G|. 0 


The foregoing result is due to Frobenius. We also have the theorem that every n; = 1 if G is 


abelian. Both of these results are special cases of the following more general theorem, which 
is due to Schur. 


THEOREM 5.13. 1,|[G: Z], Z the center of G. 


Proof. (Tate). Let m be a positive integer and let G,, = G x G x ... x G, m times. Let p; be an 
irreducible representation of G over € affording the character y; V the vector space on which 
p; acts. We have the representation p of G„ acting in V,, = V8... ® V, m times, such that p(g,, 
-> Zm) = PAZ) @ ... @ p (€m (See section 5.6). By iterating the result of Theorem 5.11, we 
see that p is irreducible. If c e Z, the irreducibility of p; implies that dc) =redly. nede€, 
Evidently y; is a homomorphism of Z into €*. By the definition of p we have 

plee XIXE I X D= ely 


m 


and hence for ce Z, 


P(Cy XX Cm) = Yili) VC) Ly, = Yili Cm) Ly 


It follows that the subset D of elements of G, of the form (c4, Cz, ... Cm), C; € Z, Mfes 1l is in 
the kernel of p. Evidently D is a normal subgroup of G,,, and p defines a representation ? of 
Gm/D that is irreducible. We now apply Theorem 5.12 to P to obtain »*"llGw/DI = IGAZI- *. This 
implies that (6V/»iZ)"e21ZI-* and this holds for all m. Put u = |G\/n,Z|. Then the relation shows 


that the Z-submodule M = Li-oZ Of Q is contained in z|Z/!. Since Z is noetherian, so is 2|Z/! 
and hence so is M. Thus M is finitely generated as Z-module. Since 1 € M and uM c M, it 
follows that u € Z. This proves that (7,|Z))||G|. Then n;[G : Z] as required. O 


Let F be a subfield of €. Then / N F is a subring of F called the ring of algebraic integers 
of F. The study of the arithmetic of such rings constitutes the theory of algebraic numbers. We 
shall give an introduction to this theory in Chapter 10. Now suppose F is a splitting field for 
the finite group G. Then F[G] = M, (F)® ... ® M, (F) and s is the number of conjugacy 
classes of G (see p. 280). We have s inequivalent irreducible representations p; over F and 
these remain irreducible on extension of F to €. Thus {pj¢, ..., psc} a set of representatives of 
the classes of the irreducible complex representations. If p; acts on V/F, then to compute 
X%pi(g), g € G, we choose a base for V/F and take the trace of the matrix of y,;(g) relative to 
this base. Nothing is changed if we pass to €. Thus we see that x,,¢(g) = x,,(g). This shows that 


the irreducible complex characters have values in F. We know also that these values are 
contained in /. Since any character is an integral combination of irreducible characters, this 
gives the following 


THEOREM 5.14. Let F be a subfield of € that is a splitting field for the finite group G. 
Then any complex character of G has values that are integral algebraic numbers of F. 


In section 4, we showed that Q is a splitting field for S,,. Hence we have the 


COROLLARY. The complex characters of S,, have values in Z. 


5.8 BURNSIDE’S pq’ THEOREM 


One of the earliest applications of the theory of characters was to the proof of the following 
beautiful theorem due to Burnside. 


THEOREM 5.15. Jf p and q are primes, then any group of order pq’ is solvable. 


Quite recently, John Thompson succeeded in giving a proof of this theorem that does not use 
representation theory. However, this is considerably more complicated than the original proof 
with characters, so the original proof—which we shall give here—remains a good illustration 
of the use of representation theory to obtain results on the structure of finite groups. 

We prove first the following 


LEMMA. Let y be an irreducible complex character of a finite group G, p a representation 
affording y. Suppose C is a conjugacy class of G such that (\C\, y(1)) = 1. Then for any g € C, 
either y(g) = 0 or p(g) € Cl. 


Proof. Since (|C|, 7(1)) = 1, there exist integers / and m such that /|C| + my(1) = 1. Then 


x) _ x(g) 
A, g 
(72) aD = I|C|- Al) + my(q). 


Now y(g) is an algebraic integer, and by Proposition 5.4 (p. 283), \Clzt@yx\) = Clg", where n = 
deg p = (I), is an algebraic integer. Hence y(g)/n is an algebraic integer. We recall also that 
y(g) < n and this was proved by showing that y(g) is a sum of n roots of unity (p. 270). Thus 
x(g) = @, +... + @, where the w; € W, a cyclotomic field of complex roots of unity (BAI, p. 
252). Let H = Gal W/Q. and let s e H. Then s maps roots of unity into roots of unity. Hence 
sy(g) is a sum of n roots of unity. Hence '#9 <» and Isx@VxO"<! Tt is clear that since a = y(g)/ 
y(1) is an algebraic integer, so is sa = sy(g)/y(1) and |sa| < 1. Then the norm “wl@) = Tlewsa 
satisfies |Nj@(a)| < 1. Since this is an algebraic integer and a rational number, it follows that 
INwala)| € 2°. Hence either Nyala) = 0, in which case a = 0 and y(g) = 0, or [Ny@(a)| = 1. In 
tbe latter case, la| = 1, \y(g)| = n, and Proposition 5.3 shows that p(g) = œl, œ a root of unity. 


We recall that the only abelian simple groups are the cyclic groups of prime order. Since 
this class of simple groups is rather trivial, one generally excludes it from the study of simple 
groups. We follow this convention in the following 


THEOREM 5.16. Let G be a finite (non-abelian) simple group. Then no conjugacy class 
of G has cardinality of the form p°, p a prime, a > 0. 


Proof. Suppose C is a conjugacy class of G such that |C| = p“, p prime, a > 0. Let p4, ..., p, be 


the irreducible representations of G, X1, ..., 7, the corresponding characters. We assume p} is 
the unit representation, so 7;(g) = 1 for all g. Let n; = 7,(1), the degree of p;. If ptm then the 
foregoing lemma shows that either 7g) = 0 or p{g) € €l for every g e €. Now the elements g 
€e G such that p({g) € Cl forma normal subgroup G, of G. Hence if 7g) # 0 for some g € C, 
then G; # 1. Then G; = G since G is simple. Also since G is simple and p; is not the unit 
representation for i > 1, it follows that G = p(G) if i > 1. Since p(G) is abelian, this is 
excluded. Hence if ptn and i > 1, then ye) = 0 for all g e C. We now use the orthogonality 
relation (60), which gives 


(73) ) xi (g)y(1) =} nixilg 


for g € C. Since n= 1 and y,(g) = 1, we have some n; i > 1, divisible by p. Let n, ..., n, be 
the n; i > 1, divisible by p. Then (73) gives the relation 


1+9 njyz(g)=9, gec. 


Since the n; are divisible by p and the y;(g) are algebraic integers, this implies that 1/p is an 


algebraic integer, contrary to the fact that the only rational numbers that are algebraic integers 
are the elements ofz. o 


We can now give the 


Proof of Theorem 5.15. Let |G| = pq? where p and q are primes. Let P be a Sylow p-subgroup 
of G. Since P is of prime power order, its center Z# 1. Ifz # 1 is in Z, then the centralizer C(z) 
ofz contains P, so [G : C(z)] is a power of q. Now [G:C(z)] is the cardinality of the conjugacy 
class C containing z (BAI, p. 75). Hence if [G:C(z)] > 1, then G is not simple non-abelian by 
Theorem 5.16. On the other hand, if [G: C(z)] = 1, then z is in the center of G. Then the center 
of G is not trivial, so again G is not simple non-abelian. It is now clear that unless G is cyclic 
of prime order, it contains a normal subgroup H # 1, #G. Using induction on |G| we can 
conclude that H and G/H are solvable. Then G is solvable (BAI, p. 247). o 


5.9 INDUCED MODULES 


Let G be a group, H a subgroup of finite index in G, o a representation of H acting on the 
vector space U/F. There is an important process, introduced by Frobenius, for “extending” ø to 
a representation o? of G. We recall that we have an action of G on the set G/H of left cosets of 
H in G. If G/H = {H, = H, Ah, ..., H,} and g € G, then gH; = Hg); and m(g) is a permutation 


of {l, ..., r}. The map z: g ~ a(g) is a homomorphism of G into the symmetric group S, (BAI, 
p. 72). Now put US = UW), the direct sum of r copies of U. Let {s), 55, ..., S,} be a set of 
representatives of the left cosets of H, say, H; = s,H, 1 < i < r. From now on we shall call such 
a set a (left) cross section of G relative to H. If g € G, then 


(74) gs; = Spgitti(g) 
where u;(g) € H. We can define an action of G on UF = UO by 


(75) guy, EET u,) z (0 (Hri) 11(9) Ux(g)- Mg reey O (Hirig) 1r (g))Unig)- ip) 


where the u; € U. Using the fact that z is a homomorphism,and that H acts on U, we can verify 
that (75) does indeed define an action of G on U®. Since this action is by linear 
transformations in a finite dimensional vector space, we have a representation of of G acting 


on UC. We can verify also that another choice of cross section of G relative to H defines an 
equivalent representation. All of this will become apparent without calculations by giving an 
alternative, conceptual definition of of, as we shall now do. 

Let B = F[H] the group algebra of H, and A = F[G], the group algebra of G. To be given a 
representation o of H amounts to being given a module U for B that is finite dimensional as 
vector space over F. Now B is a subalgebra of A and hence 4 is a B-B-bimodule in which the 
actions of B on A are left and right multiplications by elements of B. We can form A ® pg U, 


which is a left ,A-module in which for a,, a, E A and u € U we have 


(76) A,(a,®u) = a,a,@u. 
We shall now show that [A ® p U : F] < œ, so the A = F[G]-module A ® p U defines a 


representation of G. Moreover, we shall see that this representation is equivalent to of as 
defined above. 

As before, let {s} 55, ..., 5} be a cross section of G with respect to H. Then any element of 
G can be written in one and only one way in the form s;h, h € H. Since the elements of G form 
a base for A, it follows that A regarded as right B-module is free with (s), ..., s,) as base. It 
follows that every element of A ® g U can be written in one and only one way in the form 


(77) $1 @uy +5, @u2 + +5,@u,, 
u; € U. Hence if (u, ..., u™) is a base for U/F, then 


-ini ! mN 
(78) (s, @u?,...,8, Qu” s+** 55 ul ...,5, Qu”) 


is a base for A ® p U, so [A ® g U : F] = rn < œ. Thus the module A ® ẹ U defines a 
representation of G. 


Now let g € G. Then, by (76) and (74), we have 
g(d.5,@u;) = Lgs,@u; 
= DSrigitti(g)@u; 
(79) = È Sri @Hilg)u; 
= È Si® Hrg- 1i(G aig) 'i 


= Y'5;@O (Mai) 1i(G) Urg- 
Comparison of this with (75) shows that the map " (ti-u) ~> Eissu is an equivalence of the 
induced representation of G as defined first with the representative of G provided by A ®}U. 


The module UČ (or A @,U) is called the induced G-module U and the associated 


representation of is called the induced representation of G. As we have seen, if (uv), ..., u™) 


is a base for U/F, then (78) is a base for the induced module. Now suppose 


(80) hu® = Y a(h)u? 
j 


for A € H, so h ~ a(h) = (a;(h)) is a matrix representation of H determined by o and 


(81) Zalh) = XZ a;lh) 


is the character of o. By (79), the matrix of o%(g) relative to the base (78) is obtained by 
applying the permutation z(g) to the r rows ofn x n blocks in the matrix 


(82) diag {a(H1(g))..., 2e(u,(9))}. 


Only the non-zero diagonal blocks of the matrix thus obtained contribute to the trace. These 
blocks occur in the ith row (of blocks) if and only if z(g)i = i, which is equivalent to gs,H = 


s,H and to s7! gs; € H.It follows that the character 


(83) 105(9) = L'Z0(S; gsi) 


where the summation >' is taken over all i such that s;! gs, e H. This can be put in a better 
form by extending the definition of the function y, from H to F to a function % on G defined by 
is) = 0 if g € G- H. Using this, we obtain the formula 


(84) Jo°(9) = >. (Sr ‘gsi. 


i=] 


This can be written in another form if G is a finite group, by noting that Z(t 'gh) = %(9). This is 
clear if g € H since y, is a class function and if 9¢4, then * 'gh¢H and j,(h”‘gh) = 0 = 7,(9), Hence 


oe l , E 
AN as) = Fy Z((s;h)~'gs;h). 
heH 


Then (84) gives the formula 
(85) Zelg) = È hala ‘ga. 


EXAMPLES 


1. Let G be the dihedral group D,, of order 2n generated by r, s, such that 7” = 1, s=], 


srs! =r}. Let H =, oa representation of degree 1 of H such that r ~ œ1 where œ” = 1. We 
may take S; = 1, s) = s as representatives of the cosets of H. Then z(r) = 1, z(s) = (12), and 

FSi = Sir, 18, = Sar} 
SS; = $,l, SS = $,1. 


It follows that a matrix representation determined by o% maps 


wo 0 (0 1 (cf. p. 262 
wre Phy yy ee 


2. Let G be as in example 1, but now let H = «>. Let ø be the representation of degree 1 
such that s +» —1. We have @= 4 urtu--ur"'H, so we may take {s;} where s;=7/~!, 1 <i <n, as 
a set of coset representatives. We have 
SS) = SiS, SS, =Syan-;S if i>. 


It follows that we have a matrix representation determined by of such that 


J 0 1 O 0 J 


3. An interesting special case of induction is obtained by beginning with the unit 
representation 1 = 1y of the subgroup H acting on a one-dimensional space U, = Fu. Let {s}, 


..., Sp} be a cross section of G relative to H and suppose 9% = S«“9) where u(g) € H. The set of 
elements s; ® u, 1 <i <r, is a base for uF and, by definition of the induced representation, the 
action of g on U,® is given by 968w) = suu, On the other hand, we have the action of G on the 
set G/H of left cosets {s,H|l <i < r} given by %#)=sww«!, It is clear from this that 1° is the 
permutation representation obtained from the action of G on G/H. 

We can determine the character y,° for the representation 1°. It is clear from the formula 
91") = w®* that the trace of the matrix of 1°(g) relative to the base (S,®u, ..., S„®u) is the 


number of fixed points of z(g). This is the number of cosets aH such that g(aH) = aH. Hence 
we have the formula 


(86) 7,°(g) = |{aH|gaH = aH}|. 


We note also that the condition gaH = aH is equivalent to “!g = a!ga € H. 

We recall that the action of G in G/H is transitive and any transitive action of G on a finite 
set N = {1, 2, ...} is equivalent to the action of G on a set of left cosets relative to a subgroup 
H of index n in G (BAL, p. 75). In fact, if G acts transitively on N, then we can take H to be the 
stabilizer of any element of N. It is clear from this that the study of induced representations of 
the form 1,,°, where 1y is the unit representation of a subgroup H of finite index in G, is 


essentially the same thing as the study of transitive actions of G on finite sets. 


We shall now give two useful characterizations of induced modules in the general case in 
which H is a subgroup of G of finite index in G, U a module for H. Let A = F[G], B = F[HA], 
fsi, ... 8,} a cross section of G relative to H, and consider the induced module UC of G. We 
have US = A ®,U = @(s;@ U) and [s; ® U: F] =[U: F] =n. Hence the map x ~ s;® x of U into 
s;® Uis a linear isomorphism. Since we may take one of the s; = 1, we see that the map x ~ 1 


® x is a linear isomorphism of U onto the subspace 1 ® U = {1 ®) xix € UJ of US. If be B, 
then b( 1 @x)=b@x=b1 ®@x=1 @ bx. Hence x ~ 1 ®x is a B-isomorphism of U onto 1 ® U, 


which is a B-submodule of UC. We shall now identify U with its image 1 ® U and so regard U 
as a B-submodule of U®. It is clear also that we have UC =5,U ® ... ®s,U, a direct sum of the 


subspaces s,U. The properties we have noted give a useful internal characterization of UČ that 
we state as 


PROPOSITION 5.5. Let V be an A-module. Then V is isomorphic to UČ for some B- 
module U if and only if V contains U (strictly speaking a B-submodule isomorphic to U) 
such that V=s,U ® ... ®s,U for some cross section {s}, ..., S,} of G relative to H. 


Proof. We have shown that U® has the stated properties for the B-submodule U( = 1 ® U) and 
any cross section of G relative to H. Conversely, let V be an A-module having the stated 
properties. Then V = AU. Since s; € G, s; 1s invertible in A and hence the map x ~ sx is a 
bijection. It follows that [s;U : F] = [U : F] and since ¥ =s:U®=@®s,U, [V:F] = [G:H][U:F], We 
have the map (a, x) ~ ax of A x U into V, which is F-bilinear. Since (ab)x = a(bx) for b € B, it 
follows that we have an F-linear map 7 of A ®,U into V such that y(a ® x) = ax. Evidently, y is 
an A-homomorphism. Since V = AU, ņ is surjective and since [V : F] = [A @®}U : F] < ~, 7 is 


injective. Hence V and U® are isomorphic, A-modules. O 


The second characterization of U that we shall give is a categorical one that applies to any 
ring A and subring B. We consider the categories of modules 4-mod and B-mod. If M is an A- 
module, we can regard M as B-module. Evidently, homomorphisms of A-modules are 
homomorphisms of B-modules. In this way we obtain a functor R from the category A-mod to 
B-mod sending an A-module M into M regarded as B-module and 4-homo-morphisms into the 
same maps regarded as B-homomorphisms. We call R the restriction of scalars functor from 
A-mod to B-mod. Now let N be a given B-module. Then we can form A @pN and regard this as 


an A-module in the usual way. We have a B-homomorphism u of N into A @pN sending y € N 


into 1 ® y. We claim that the pair (A @ N, u) constitutes a universal from N to the functor R (p. 
42). This means that if M is a (left) A-module and 7 is a B-homomorphism of N into RM = M, 
then there exists a unique homomorphism à of A @pN into M (as A-module) such that 


N y R(A ® N) 
(84) R(Ñ) 
RM 


is commutative. To see this, we consider the map (a, y) ~ a(yy) of the product set A x N into 
M. Evidently this is additive in a and y and if b € B, then (ab)(ny) = a(b(ny)) = a(n(by)). 


Hence (a, y) ~ a(yy) defines a balanced product from A x N into M (p. 126). Hence we have a 
group homomorphism à of A ® N into M such that "®))= am»), It is clear that à is a 
homomorphism of A-modules and if y € N, then Rur) = RIMAS) = 1y) = ny. Hence (87) is 
commutative. Now let 7’ be any A-homomorphism of A @pN into M such that yy = n'uy = n'(1 @ 


y). Then (a 8 y) =n(all @y)) =an'(1 @ y) =a(ny). Hence "=ñ, so à is unique. 

The fact that for any B-module N, (A @pN, u) is a universal from N to the functor R implies 
that we have a functor from B-mod to A-mod sending any B-module N into the A-module A ®@,N 
and mapping any homomorphism 7 of B-modules into 1 ® y, which is a homomorphism of 4- 
modules. This functor is a left adjoint of the functor R (p. 49). 

This is applicable in particular to 4 = F[G] and B = F[H] where H is a subgroup of finite 
index in G. In this case for a given B-module U such that [U : F] < œ, we obtain the universal 
object U = A ®}U, which is finite dimensional over F and hence provides a representation of 
G. 


EXERCISE 


1. Let G be a finite group, H a subgroup, and let o be the regular representation of H. Show 
that øf is the regular representation of G. 


5.10 PROPERTIES OF INDUCTION. FROBENIUS RECIPROCITY 
THEOREM 


We shall now derive the basic properties of induced representations. We prove first 


THEOREM 5.17. Let K be a subgroup of finite index in G, H a subgroup of finite index in 
K, o a representation of H acting on U, p a representation of G acting on V, py the 


restriction of p to H. Then 
(1) His of finite index in G and of and (o%)° are equivalent. 
(2) If W is a o(H)-invariant subspace of U, then WF is a o%(G)-invariant subspace of 
UC. Moreover, if U = W, ® W» where W, is o-invariant, then US = W, ® WS. 
(3) o° @ p and (0 ® py)® are equivalent. 


Proof. (1) Write A = F[G], B = F[H], C = F[K]. It is clear that H is of finite index in G. (In 
fact, [G : H] = [G : K] [K : A].) Now of acts on © 22% (" acts on 4@AC@eU), and of acts on A 
@,U. The A-modules 4@c\©@«) and (4 @c©) @e are isomorphic (Proposition 3.7, p. 135) and A 


® C and A are isomorphic as left 4-modules (Proposition 3.2, p. 130). Hence 4@«(©®s") and A 


®,U are isomorphic as 4-modules, which means that (o*)° and of are equivalent. 


(2) If Wis o(A)-invariant subspace of U, then this is a B-submodule of U. Since A is free as 
right module over B, WC = A @,W can be identified with its image in UC = A @,U. Then 
(o|W)° is a subrepresentation of of. The second statement follows in the same way. 

(3) The two 4-modules we have to consider are ‘4@*")®r! and 4@n(U ®©, V), In the first case 
the action of g e G is OSS) = gaa) and in the second case we have 
Wa@A(x@yy)) = Ga@lx@ey), Hence the F-isomorphism of (4@04)@r" onto 4@sU@r") sending 
l@s) ~> @@(x@r¥) is not an A-isomorphism. We shall define an A-isomorphism of 4@s9(U @-") 
onto 4%») @r¥ such that 1@ulx@ry) ~ 1@,x@ry and hence $a Sr!) => WOO, For a fixed g € 
G we consider the map ©» ~ @@e)@rav 8 Ux V into 4824) @r" | Since this is F-bilinear, we have 
an F-homomorphism t, of U ®p V into \49»)®r" such that *@r’ *W@e@r9, Since G is a 
base for A/F, for any “= >%%%** we can define an F-homomorphism t, of U ®pV into 
(A@gUI@rV by % = L% ty, Then % Sr) = Eager. We now have a map 


(a, EXO FY) > ÈX Ary) 
of A x (U pV) into 4@s4)@r¥, which is F-bilinear. Moreover, if h € H, then 


TX Ary) -Thx Ahy) = (Gh® gx) @rghy — R ghx)@-ghy 
= (JQ ghx)&rghy —(9@ ghx)®@ rghy = 0. 


It follows that we have a balanced product of A as right B-module and U ®pV as left B-module. 
Hence we have a homomorphism of 48,(U ®©; V) into A@sUI@rV sending 9@alx@rv) > G@sxI@ray, 
This is an F-homomorphism and if g' e G, then #W@s*®@ry))=G9@WKX@ey) and 
(9'9@5x)@r9'Gy = g'g). Hence our map is an A-homomorphism. It is clear also that the map 
is surjective and since the two modules have the same dimensionality over F, the map is an 
isomorphism. O 


For our next result on contragredience and induction we shall make use of an involution in 
the group algebra (BAI, p. 112). We note first that if g, h € G, then (gh) ! = h ! g~ ! and (g 
Do l = g, Hence ifa ~à is the linear transformation of A = F[G] into itself such that §= 9 ', then 
ab = Bà and d= a. Thus a ~à is an involution in A. We shall call this the main involution of A and 
denote it as j. If H is a subgroup of G, then the main involution in B = F[H] is the restriction to 
B of the main involution in A. Now let H have finite index and let {s}, ..., s,} be a cross 
section. Any element of A can be written in one and only one way as 
a = ¥45,b;,, bie B. Ie = Sts, ceh, we define 


(88) (alc) = Yb,c;¢B 


and we have the map (a, c) ~ (alc) of A x A into B. We list its important properties: 
(i) (alc) is independent of the choice of the representatives s; since any other choice has the 
form Sj; h; € H. Using these replaces b; by h; tb by h; lc, Then 
(hi *bi) (h,~*e) = Byki `c; = Be, and (88) is unchanged. 
(11) (a|c) is F-bilinear. Clear. 


(iii) Ifb € B, then 


i Cis 


(ab|c) = b(aļc),  (aļcb) = (aļc)b. 
This is clear from the definition. 
(iv) Ife € A, then 
(ea\c) = (alec). 


It suffices to prove this for e = g e G. Then # = =“ where z(g) € S. and u(g) € H. 
p g g r HAS 
Then 94 = 9(23:b) = Esequstil9; and 9 ‘Seen = sanlg) so 9 '¢ s LsHdg) Cua, Hence 


(ga|c) = d biu(g)” e 


(alg te) = » bing) Cui 


Thus (ga|c) = (a|g~ tc) holds for all g € G. 
(v) (s;ls;) = djl. 


This is clear from the definition. 


We shall now establish commutativity of induction and contragredience. This is given in 


THEOREM 5.18 Jf H is a subgroup of finite index and o is a representation of H, then 
(o°)* and (o*)° are equivalent. 


Proof. Let U be the vector space on which o acts and let U* be the dual space of linear 
functions on U. If x e U and y* e U*, we write ol =». Then O*!> is F-bilinear. The 
space U* is a left B-module defined by the contragredient representation o* of H and we can 
make this a right B-module by defining »*> = 5)", b e B. Since w*ho = O*lh 'x> for h €e H, by 
definition of o*, we have ©)*hix> = &*lhx), which implies that 


(89) (y*b|x) = <y*]bx) 


for xe U, y* e U*,be B. We now introduce 


(90) (y*(alc)|x> EF. 


This is defined for x € U, a,c € A, y* e U*, and is F-bilinear in c and x for fixed a and y*. 
Moreover, ifb € B, then by property (iii) above, 


<y*(alcb)|x) = <y*(alc)b|x) = <y*(alc)|bx). 


By definition of the tensor product, this implies that we have an F-linear map of A @,U into F 
sending «®* ~ <y*(a\l>., Thus we have an element ø, ,« of the dual space (A ®gU)* such that 


(91) Pa,ys(C@X) = <y*(alc)|x). 


Now the map (a, y*) ~ 94, y» is F-bilinear and since 


(y*(ablc)|x> = <y*b(alc)|x) = <(by*) (ale)|x), 


Pab, y* = Pa, by». Hence we have a linear map y of A®gU* into (4®gU)* sending a®y* into 94, 
ys Tf ge G, WaslOx)= payg (COx)) = Part Ox) = O*lalg lx), On the other hand, 
Pole@x)= alex. Hence by (iv), Werl& = Pøs, which implies that y is an 4A- 
homomorphism. To conclude the proof, we shall show that 7 is injective. Since A®_U* and (A 
®,U)* have the same finite dimensionality over F, this will imply that 7 is an isomorphism. 
Now suppose 1(£4,@»7)=9. We can write E48») = Lis@=? and the condition "©s8:)=0 implies 
that =F =9 for all c e A, x e U and hence Eefisile) = for all c. If we take c = S; and use 
(v), we obtain z,* = 0 and hence £s®=:7 = 0. This concludes the proof. o 


Our next main objective is to derive an important reciprocity theorem due to Frobenius, 
which in its original form states that if p is an irreducible complex representation of a finite 
group and o is an irreducible complex representation of a subgroup H, then the multiplicity of p 
in øf is the same as the multiplicity of ø in the representation py of H. As we shall show 
below, this can be proved by a simple calculation with characters. We shall first give a 
conceptual proof of an extension of Frobenius’ theorem. Our proof will be based on the 
important concept of intertwining numbers for representations and on the following 
proposition. 


PROPOSITION 5.6. JfH is a subgroup of finite index in G, U a module for B = FH], and 
V a module for A = F[G], then hom,(U°, V) and hom,(U, Vy) are isomorphic as vector 
spaces over F. 


Proof. We remark that this result can be deduced from a general result on adjoint functors 
applied to the restriction functor from F[G]-mod to F[H]-mod and its left adjoint. However, 
we shall give an independent proof. As before, let u be the map of U into UF sending x + 1 ® x. 
This is a B-homomorphism. Hence if ¢ € hom,(U®, V), then Cu € hom,(U, Vy). The map ¢ ~ 


Cu is an F-linear map of hom ,(U°, V) into hom,(U, Vy). The result we proved in diagram 
(87) is that the map ¢ ~ Cu is surjective. It is also injective. For, if ¢u = 0, then ¢ (18x) = 0 for 
all x e U. Since €is an A-homomorphism, ¢(a®x) = C(a(1@x)) = alx) = 0 for all a € A, x € 
U. Hence ¢ = 0. Thus ¢ ~ Cu is an isomorphism of hom,(U®%, V) and hom,(U, Vy) as vector 
spaces over F. O 


If and z are representations of G acting on F and W respectively, then the dimensionality 
[hom ,(V,W):F] 


is called the intertwining number I(p, t) of p and t(or of V and W as F[G]-modules). If V = V; 
® ... ®V,, where the V; are p(G)-invariant subspaces, then we write p =p, ® ... ®p,, where p; 
= p|V;. Similarly, let r=, ® ... ®t, where q, = |W; and W; is t-invariant. We have the vector 
space decomposition 


(92) hom ,(V, W) = ® hom ,(V;, W;). 


This implies that 


(93) (p,t) = È 1(pj,T))- 

i,j 
Ifp and t are completely reducible and the p; and q, are irreducible, then this formula reduces 
the calculation of /(p, T) to that of the /(p;, t) where p; and q; are irreducible. If p and t are 


irreducible and inequivalent, then /(p, t) = 0 by Schur’s lemma. It is clear also that /(p, T) is 
unchanged if we replace p or t by an equivalent representation. Hence if p and t are 
equivalent, then /(p, t) = U(p, p) = [End,(V, V):F']. Hence if p and t are irreducible and F is 


algebraically closed, then 


0 if p and t are inequivalent, 
(94) (p,t)=5, . | 
| if pand tare equivalent. 


An immediate consequence of (93) and (94) is 


PROPOSITION 5.7. Jf F is algebraically closed, p is irreducible, and t is completely 
reducible, then I(p, t) = l(t, p) = multiplicity of p in t (that is, the number of irreducible 
components of t isomorphic to p). 


We can now prove the fundamental 


FROBENIUS RECIPROCITY THEOREM. Let F be an algebraically closed field, p an 
irreducible representation of G over F o an irreducible representation of a subgroup H of 


finite index in G. Assume that of and py are completely reducible. Then the multiplicity of p 
H 


in o° is the same as the multiplicity of a in py. 


Proof. By Proposition 5.6, I(of, p) = l(a, py). By Proposition 5.7, (o%, p) is the multiplicity 


ofp in of and i(a, py) is the multiplicity of ø in py. Hence these multiplicities coincide. o 


The Frobenius reciprocity theorem is usually stated for finite groups and algebraically 
closed fields whose characteristics do not divide the group order. In this situation the 
hypothesis that H is of finite index and that o% and py are completely reducible is 


automatically satisfied. The proof of this result and some of the others we gave on induced 
representations of finite groups can be made by calculations with characters. As an illustration 
of this method, we shall now prove the Frobenius reciprocity theorem for G finite and F =€ by 
using characters. Let the notation be as above and let y,, y,,etc. denote the characters of p, a 


etc. We recall that if p is irreducible and z is arbitrary, then (y,y,) = (/-l%,) is the multiplicity 
ofp int. The Frobenius reciprocity theorem is equivalent to 


(95) (X65 lXo)e = (LolXp,,)a- 


We use the formula (85) for ys Then we obtain 


ee 
(telz) = > Gil — ¥(a~ 'ga)z,(g) 


a,geG 


— 7,(g)y,(aga~') 


a,geG 


L zi a to(h)y,(aha~ ') 


heH 
On the other hand, 


l 
(XelXo,)u = (h)z,(h) 
r= È pjt 


y,(h)y,(aha~ 
à ial jpj tenet) 


since y, is a class function on G. Comparison of the two formulas gives (95). 


EXERCISES 


1. Construct a character table for G = As. Sketch: There are five conjugacy classes in G 
with representatives 1, (12) (34), (123), (12345), (13524) = (12345)? with respective 
cardinalities 1, 15, 20, 12, 12. Let yı be the character of the unit representation, v the 


permutation representation obtained from the natural action of G on {1, 2, 3, 4,5}. We 
have the following table 


1 (12) (34) (123) (12345) (13524) 


Calculation of (y,|y,,) and (y,|v;) gives yy = 71 + % where x, is irreducible. This and the 
above table give y). To obtain the three missing irreducible characters, we look at the 
induced characters obtained from linear (degree 1) characters of the subgroup 


H = {1,u = (12345), u? = (13524), uè = (14253), 
u* = (54321), (13) (54), (24) (15), (35) (21), (14) (23), (25) (34)}. 


We find that <> has index two in H, so we have the linear characters 4, and 4, on H 
where 1, = 1 and 42(v) = 1.220) = -1 ife <u>, The set 


C = f1,w = (13245), w? = (12534), w° = (14352), w* = (15423), (12) (34)} 


is a cross section of G relative to H. Using this we can compute 1,7 and 1,° as 


1 = (12)(34) (123) (12345) (13524) 


Using 02%, i = 1, 2, and (41), we obtain 2, = 7; + v3 where y3 is irreducible. 
Also we have (:%lz)=Us"lz) = @2"l4=9 | which implies that 1,° = y4 + y; where y4 and 7; 
are irreducible. The values of v1), 1 < i < 3, and Lhux*)=© give y4 = 3 = y;(1). It 
follows that we have the table 


1 (12) (34) (123) (12345) (13524) 


—| 0 0 
b c d 
þ' Cc d 


Since (54321) = (12345) ! is conjugate to (12345), %4 and y; are real (exercise 7, p. 
279). Using %4 + 75 = 4%, we obtain a + a'=~7,b+ b'=0,c+c'=1=d+d'. Using 
(Xak) = 0,4 for i = 1, 2, 3, 4, we obtain a system of equations for a, b, c, d whose 


solutions are a = — 1, b = 0, c = (1 + ¥5)/2, d = 1 — c. Either determination can be used 
to complete the table. 


. Prove Theorems 5.17 and 5.18 for complex representations of finite groups by character 
calculations. 


. Let B be a subring of a ring A, M a (left) B-module. Then homg(4, M) becomes a left A- 
module if we define (af)x = f(xa) for fe hom,(A, M), a, x e A (Proposition 3.4, p. 134). 
This is called the produced module of A determined by M. Note that the map v: f ~ fl of 
homz(4, M) into M is a B-homomorphism. Show that the pair (hom,(4, M), v) 
constitutes a universal from the restriction of scalars functor R to M (p. 137). 


. Let G be a group, H a subgroup of finite index, A = F[G], B = F[H], and let U be the B- 
module determined by a representation o of H. Then homg(4, U) is finite dimensional 
over F and defines the produced representation Žo of G. Show that %(o*) is equivalent 


to (o@)*. (This shows that in the theory of finite dimensional representations, produced 
modules are superfluous.) 


. Let G be finite, and let F be an algebraically closed field whose characteristic does not 
divide |G]. Let p be the permutation representation of G determined by the action of G on 
G/H where H is a subgroup. Use Frobenius reciprocity to show that the dimensionality 
of the space Inv p = {x|gx =x, g € G} is one. 

. (Addendum to Clifford’s theorem.) Let p be an irreducible representation of G acting on 
V/F and let Ha G. Let W be a homogeneous component of V as F[H]-module. (Recall that 
Py is completely reducible by Clifford’s theorem.) Let T be the subgroup of G 
stabilizing W. Show that (i) H c T c G and [G: T] = m, the number of homogeneous 
components of V as F[H]-module, and (ii) if y is the representation of T acting on W, 
then p = y? and y is irreducible. 


. (Mackey.) Let H be a subgroup of finite index in G and let U be an F[H]-module. Let V 
be the set of maps fG — U such that hg) = hflg). Using the usual vector space 


structure on V and an action of G defined by (gf) (x) = f(xg), fe V, g, x € G. Show that 
V becomes an F[G]-module isomorphic to US. 


5.11 FURTHER RESULTS ON INDUCED MODULES 


One of the main problems that we shall consider in this section is that of obtaining useable 
criteria for an induced representation (or character) to be irreducible. To this end we shall 
need to study the following problem: Suppose that H and K are subgroups of a group G, H of 
finite index in G, and ø is a representation of H acting on the vector space U/F. What can be 


said about the structure of of g = (o0")x, the restriction to K of the induced representation of of 
of G? If V is an F[G]-module, we denote V regarded as an F[K]-module by Vy. Thus we are 
interested in the structure of US% = (U®)g. We identify U with the F[H]-submodule 1@U of 
UC. If g € G, then gU is stabilized by the action of the subgroup 8H = {ghg '|h € H} since if x 
e U, then [ghg gx = ghx € gU. Hence we have the submodules (gU)z;; and (gU)¢ pn gy of 
Ugy and Ux p gy respectively. We remark that since £H is of finite index in G, K N £H is of 
finite index in K. Hence we can form the induced F[K]-module @Ux>#* =@Uxam)*, We shall 
obtain a decomposition of U%, into submodules of the form Uxam”. To describe this, we 
require some simple remarks on double cosets. 

We recall that G is a disjoint union of the double cosets KgH (BAI,p. 53). Moreover, KgH 
is a union of left cosets kgH, k e K, and since H is of finite index in G, the number of left 
cosets of H contained in KgH is finite. We now note that if k,, k, € K, then k,gH = k gH if and 
only if k (K N £H) = k(K N £H). For, k,gH = kogH if and only if kpg = k,gh, h € H, and this 
holds if and only if k ! kọ = ghg | h e H. This last condition is equivalent to k,(K N SH) = 
k,(K N £H). It now follows that {k;} is a cross section of K with respect to K N £H if and only 
if {k,g} is a set of representatives of the left cosets of H contained in KgH. 

We can use this result to prove the following important theorem: 


MACKEY’S DECOMPOSITION THEOREM. Let H be a subgroup of finite index in G, 
K an arbitrary subgroup of G, and let U be an F[H]-module. Identify U with the F|H]- 
submodule 1 ® U of U? and let A be the set of double cosets D = KgH. Then 


Us, _ @ DU 


DeA 


where DU = Es»9U is an F[K]-submodule of UC x and PY = (gU)Kom)* for any g € D. 


Proof. Let {s}, ..., S,+ be a cross section of G with respect to H. Then UF = 5,U@ ... ®s,U 
(Proposition 5.5, p. 290). Ifg € G, then D = KgH =s; HU... Us; H where the s; are distinct. 


Then DU = KgHU = s, U ® .... ®s; U. It follows that U"«= @x4PU. Now DU = KgU is an 
F[K]-submodule of Ux and DU contains gU, which is an F[K N &H]-submodule. We have 
seen that if {k,l < i < q] is a cross section of K with respect to K N £H, then {k,g} is a set of 
representatives of the left cosets of H contained in D = KgH. Thus we may assume that {kg} = 
{5,5 +65 Sy f> 80g = k and 


DU = kygU@:"'@k,gU. 
By Proposition 5.5, this implies that DU as F[K]-module is isomorphic to (@Usw-«*, O 


We shall derive next a theorem on the structure of the tensor product of two induced 
representations. This is 


THEOREM 5.19 (Mackey). Let H, i = 1, 2, be a subgroup of finite index in G, o; a 
representation of H, acting on U,/F, and let R be a set of representatives of the double cosets 
H gH). Put H®) = H, N £H. Then the F{G] module 


® ((U,)yo.0® (GU 2) q0.0)° 


geR 


U,;“@U;" 


ile 


Proof. By Theorem 5.17.3, o{%®02% is equivalent to (o\®(o2%);,,)°, and by Mackey’s 
decomposition, (Uy), is a direct sum of the /[H,]-submodules isomorphic to the modules 
((gU>) 1.9), g € R. Hence U,;2@U,° is F[G] isomorphic to 


® (U,;@((GUy)qu.0)")%. 


geR 
By Theorem 5.17.3, 
U, @((gUz) yu.) = (U, Jy. ®@(gU >) qa.0)™, 


Hence, by the transitivity of induction (Theorem 5.17.1), 
(U @(gU yea) = (U uea (gU )yo.0)®. 


Substituting this in the formula for U;°@U,° proves the theorem. O 


The foregoing result can be used to obtain a criterion for an induced module to be 
irreducible. We shall require also some general results on modules, some of which have been 
indicated previously in exercises (p. 251). We note first that if V, i = 1, 2, is a finite 
dimensional vector space over a field F, then we have a canonical isomorphism between V,*® 
FY, Vi” the dual space of linear functions on V, and hom,{V,, V2). This maps u*®v for u* e 


V,*, v € Vn, into the linear map [u*, v] such that [u*, v](x) = u*(x)v for x e V, (p. 165). The 
fact that we have such an isomorphism can be seen by noting that u*®v is bilinear. Hence we 
have a linear map 7 of V;*®;V into hom,{V;, V2) such that 7(u*@v) = [u*, v]. It is readily seen 
that y is surjective and since the two spaces V;*®, V, and hom,V,, V2) have the same 
dimensionality [V, : F] [V>: F], it follows that y is a linear isomorphism. 

Now suppose p; i = 1, 2, is a representation of a group G acting on V/F. As on p. 272, we 
obtain a representation p of G acting on hom,{V;, V>) if we define p(g)/ = p(g)lp,(g) 4g € G, 
l e hom,(V,, V2). The isomorphism y of V;*®, V, with hom;( V}, V2) is an equivalence of the 
representations p; * ® p, and p. To see this, let u* e V,*, v e V>. Then (p, *(g)u*@p.(g)v) is 
the map 


a (p,*(g)u*) (x)p2(g)v 
= u*(p1(g) *x)p2(g)v 
= p,(g) (u*(p,(g) *x)v) 


= p2(g)n(u* Qv)p (g) ‘x. 


Hence (p; *(g)u*@p(g)v) = p(g)y(u*@v), which implies that y is an F[G]-isomorphism of 
V,*®V, onto hom, V, V2). 

We now consider the vector space hompg)(V;, V2). Evidently, this is a subspace of hom, 
V,) and we can identify this subspace with the set of / e hom,(V,, V2) such that p(g)/ = l, g € 
G. For this condition is equivalent to p,(g)/p,(g) | = L and to p,(g)l = Ip,(g), which is the 
condition that / € homyg)(V;, V2). In general, if p is a representation of G acting on V, then the 


set of x € V such that p(g)x = x for all g € Gis denoted as Inv p. Evidently this is a subspace. 
It is clear also that x # 0 is in Inv p if and only if Fx is a onedimensional p(G)-invariant 
subspace and the restriction of p to Fx is the unit representation. Hence if p is a completely 
reducible representation of G, then [Inv p : F] is the multiplicity of the unit representation in p. 


If p; and p, are completely reducible representations, then we shall say that p, and p, are 
disjoint if they have no common irreducible constituents, that is, if there exists no irreducible 
representation that is isomorphic to a subrepresentation of both p, and p>. 


We have the following results, which are easy consequences of our definitions and of the 
F[G]-isomorphism of V;*®; V and hom,(V,, V>). 


PROPOSITION 5.8. (i) Let p, i = 1, 2, be a representation of a group G acting in Vi/F- 
Then the intertwining number 1(p;, p2) = [Inv p; * @ p, : F]. Moreover, if p) * @ po is 
completely reducible, then i(p,, p2) is the multiplicity of the unit representation in p} * ® po. 
Gi) If p1, Po, and pı * ® p, are completely reducible, then p; and p, are disjoint if and only if 
the unit representation is not a component of p; * @ p>. (iii) Let F be algebraically closed 


and let p be a completely reducible representation of G acting in V/F such that p*®p is also 
completely reducible. Then p is irreducible if and only if the unit representation has 
multiplicity one in p* ® p. 


Proof. (i) By definition, (p1, p2) = [hompg(, V2) : F]. We have seen also that homzg(V, 
V>) = Inv p where p is the representation of G acting in hom,({V,, V>) such that PON = p2l9)p:19)"', 
Since p is equivalent to p; * ® po, [Inv p : F] = [Inv p; * @ py: F]. Hence /(p;, p2) = [homps] 
(V, P): F] = [Inv p: F] = [Inv p; * @p,: F]. This proves the first statement in (i). The second 
statement is an immediate consequence of this and the fact that for completely reducible, [Inv 
p: F] is the multiplicity of the unit representation in p. (ii) If p} and p, are completely 
reducible, then (94) and Schur’s lemma show that p, and p, are disjoint if and only if /(p,, p>) 
= 0. By (i), this holds if and only if Inv p; * 8 p, = 0. If p; * ®@ p, is completely reducible, this 
holds if and only if the unit representation is not a component of p. (111) By (94) and (95), if F 
is algebraically closed and p is completely reducible, then p is irreducible if and only if /(p, p) 
= 1. By (i), this holds if and only if [Inv p*@p: F] = 1. If p*@p is completely reducible, this is 
the case if and only if the unit representation has multiplicity one inp*@p. O 


The awkward hypotheses on complete reducibility in this proposition can be removed if G 
is finite and char FtIGl, In this case we have the following criterion for irreducibility of induced 
representations. 


THEOREM 5.20. Let G be a finite group, H a subgroup, F an algebraically closed field of 
characteristic not a divisor of |G|, and let o be a representation of H acting on U/F. Then of 
is irreducible if and only if (1) o is irreducible and (2) for every g ¢ H, the representations 
of H N £H on U and on gu are disjoint. 


Proof. Evidently (1) is a necessary condition for the irreducibility of of. Now assume that ø is 
irreducible. By Proposition 5.8 (iii), øf is irreducible if and only if the unit representation has 
multiplicity 1 in («%)*@o°. Since (o%)* and (o*)° are equivalent (Theorem 5.18, p. 294), we 


can replace (o°)* by (o*)°. By Theorem 5.19, UU" = QUhon@GU now" Where g runs over 
a set of representatives of the set A of double cosets HeH. Now Viiww and (Cn »«)* mean the 
same thing: U* as module for F[H N £H] determined by the contragredient action. Hence we 
have to consider the multiplicity in every (Unan) *®W:U)nasu)" of the trivial module (giving the 
unit representation). We now note that if K is any subgroup of G and Tf is a representation of K 
on V/F, then the multiplicity of the unit representation 7, of K in t is the same as the multiplicity 
of the unit representation p, of G in r°. Since we have complete reducibility, it suffices to 
show that if z is irreducible, then p, has multiplicity 0 in rc? unless t = t}, in which case, the 
multiplicity is 1. This follows immediately from Frobenius reciprocity. We now apply this to K 
= H N 8H, g € G, and V = Unnu*@@Unom, If g € H, then these become H and U*®U. Since o 
is irreducible, the multiplicity of the unit representation in o*@o is 1; hence, the component (U* 
®U)® contributes the multiplicity 1 for the unit representation of G on U*°@U°. Hence of is 
irreducible if and only if the multiplicity of the unit representation of H N EH on 
(Unam) *®©WU)u -~u is 0 for every g é H. By Proposition 5.8, this is the case if and only if the 
representations of H N £H on U and on gU are disjoint for every g ¢ H. This completes the 
proof. o 


An interesting special case of Theorem 5.20 is obtained by taking o to be a representation 
of degree 1 of H. Then the representation of is called a monomial representation. It is clear 
from the form of the matrix representation of an induced representation given on page 288 that 
a monomial representation has a matrix representation in which all of the matrices have a 
single non-zero entry in every row and column and this occurs in the same position for all of 
the matrices. Theorem 5.20 reduces to the following criterion for irreducibility of monomial 
representations. 


COROLLARY 1 (K. Shoda). Let p be a monomial representation of a finite group G 
obtained by inducing on a degree one representation o of a subgroup H of G. Assume that 
the base field is algebraically closed of characteristic not a divisor of |G|. Then p is 


irreducible if and only if for every g e G—H there exists an h € H N £H such that o(h) + 
o(ghg '). 
Another interesting special case of Theorem 5.20 is that in which H= 6. Then £H = H for 


every g € G and the representation of H on gU is the conjugate representation £o (p. 255). 
Evidently the irreducibility criterion of Theorem 5.20 gives the following 


COROLLARY ?2. Let G, H, F, and o be as in Theorem 5.20 and assume that Ha 6G. Then of 


is irreducible if and only if o is irreducible and for every g e G — H, £c and o are 
inequivalent. 


There is an important extension of this corollary, which we shall now derive. Again, let 
H< G and let o be an irreducible representation of H acting in U/F. Let 


T(o) = {ge G|’c is equivalent to a}. 


Then 7(o) is a subgroup of G containing H called the inertial group of o. Then we have 
COROLLARY 3. Same hypothesis as Theorem 5.20, with HaG. Let y be an irreducible 
representation of the inertial group T(o) such that wy has o as an irreducible component. 


Then y® is irreducible. 


Proof. Let V be the F[T]-module on which y acts. By Clifford’s theorem applied to T = T(øo) 
and H=T, a is the only irreducible component of yy. Now consider VC, which we may assume 


contains V as a submodule, and consider the subspace gV of V? as module for F[T N £T]. Since 
H=G, Hc T AN £T and gV is a sum of irreducible F[H]-submodules isomorphic to gU. The 
representation ofH on gU is £c and since g ¢ T, this is not equivalent to ø. It follows that the 
representations of H on V and on gV are disjoint. Hence this is the case for the representations 
of T N £T on V and on gV. Then y is irreducible by Theorem 5.20. © 


A consequence of Corollary 3 that will be useful in the sequel is 


COROLLARY 4. Let G be a finite group, H=6, p an irreducible representation of G over 
an algebraically closed field F of characteristic not a divisor of |G|. Let o be an irreducible 


component of pp, T the inertial group of o. Then p = w° for some irreducible representation 
y of T. 
Proof. Since o is an irreducible component of py, there exists an irreducible component w of 
pr such that ø is an irreducible component of wy. Then y% is irreducible by Corollary 3. Since 
y is an irreducible component of py, p is an irreducible component of y? by Frobenius 
reciprocity. Hence p = y®. o 

The importance of this result is that if ™*%, then it gives a formula for p as an induced 


representation from a proper subgroup. This gives a way of establishing properties of 
irreducible representations of G by induction on |G]. 


5.12 BRAUER’ S THEOREM ON INDUCED CHARACTERS 


For the remainder of this chapter, G will be finite and the base field F will be a subfield of € 
—usually € itself. In this section, we shall prove a fundamental theorem of Brauer’s on 
induced characters. To state this, we need the following 


DEFINITION 5.3. A group G is called p-elementary if G = Z x P where Z is cyclic of 
order prime to p and P is a p-group (that is, a group of order p"). G is called elementary if it 
is p-elementary for some prime p. 


With this definition we can state 


BRAUER’S THEOREM ON INDUCED CHARACTERS. Any complex character of a 
group G is an integral linear combination of characters induced from linear characters of 
elementary subgroups of G. 


The proof we shall give of this theorem is due to D. Goldschmidt and M. Isaacs. Brauer’s 
first proof was quite complicated. A considerably simpler one was given independently by 
Brauer and Tate and by K. Asano. These proofs as well as the Goldschmidt-Isaacs proof begin 
with the following simple observations. 

Let ch(G) denote the set of Z-linear combinations of the complex irreducible characters y, 
...5 Xş Of the group G where yı is the unit character. The elements of ch(G) are called 
generalized characters. Any complex character is a generalized character and since yx; is the 
character of the tensor product of the irreducible representations affording y; and %;, xx; € 
ch(G). This implies that ch(G) is a ring, more precisely, a subring of the ring €° of €-valued 
functions on G with the usual component-wise addition and multiplication. yı is the unit of 
ch(G). If His a subgroup of G and yw is a class function on H, then we define the induced class 
function w© on G by 


| | 
(96) y"(9) = 


|H] 


where y(h) = y(h) for h € H and y(g) = 0 for g e G-— H. This is a class function on G and we 
have seen in (85) that if y is the character of a representation o of H, then y® is the character 
of o°. Ify is the character of a representation p of G, then we have 


| TTE a AG 
(97) Yx = (Wx) 
where yy is the restriction of y to H, since 9°98? = (@®px)° (Theorem 5.17.3, p. 292). Also 
transitivity of induction of representations implies the following transitivity formula 


(98) y“) =" 
if K is a subgroup of G containing H. 


Now let # be any family of subgroups of G and let ch (G) denote the set of Z-linear 
combinations of characters of the form y? where y is a complex character of a subgroup H € 
F., It is clear from (97) that ch¢(G) is an ideal in ch(G). Hence ch¢(G) = ch(G) if and only if 
11 E€ chs(G). 

The Goldschmidt-Isaacs proof of Brauer’s theorem is made in two stages: first, the proof 
that y; € chs(G) for a family of subgroups, the so-called quasi- elementary subgroups, which 


`, W(a~ ‘ga) 


acG 


is larger than the family of elementary subgroups and second, the proof of Brauer’s theorem for 
quasi-elementary subgroups. For the proof of the second part we shall prove first an extension 
by Brauer of a theorem of H. Blichfeldt that any irreducible representation of a quasi- 


elementary group is monomial. Then we shall use this to prove Brauer’s theorem for quasi- 
elementary groups. 

A group G is called p-quasi-elementary for the prime p if G contains a normal cyclic 
subgroup Z such that pt!2! and G/Z is a p-group. Evidently, if G is p-elementary, it is p-quasi- 
elementary. A group G is called guasi-elementary if it is p-quasi-elementary for some prime p. 

Let G be p-quasi-elementary, Z a normal cyclic subgroup such that pflZ| and G/Z is a p- 
group. Any subgroup H of G is p-quasi-elementary. For, H N Z is cyclic of order prime to p 
and H/(H N Z) = HZ/Z a subgroup of G/Z. Hence H/(HMZ) is a p-group. The argument shows 
that if Ptl#l, then |H/(H N Z)| = 1 so H c Z. It follows that the subgroup Z specified in the 
definition of p-quasi-elementary is unique and hence this is a characteristic subgroup of G (p. 
109). One can give a useful alternative definition of p-quasi-elementary, namely, G = AP where 
A is cyclic and normal in G and P is a p-group. For, if this is the case, then we write A = Z x W 
where PtiZl and |W] = p*. Then Z and W are unique, hence characteristic in A and hence these 
are normal subgroups of G. Evidently G/Z = WP, whichis a p-group. Conversely, suppose G is 
p-quasi-elementary and Z is as in the definition. Let P be a Sylow p-subgroup of G. Then P N 
Z = 1 and PZ = ZP is a subgroup such that |PZ] = |G|. Hence G = ZP where Z is cyclic and 
normal in G and P is a p-group. It is readily seen that a p-quasi-elementary group G is 
elementary if and only if the subgroup Z is contained in the center and if and only if a Sylow p- 
subgroup is normal. It follows that a subgroup of a p-elementary group is p-elementary. 

The first part of the proof of Brauer’s theorem is based on 


LEMMA 1 (B. Banaschewski). Let S be a non-vacuous finite set, R a subrng (rng = ring 
without unit, BAI, p. 155) of 2°. Then either R contains 1, (and hence is a subring) or there 
exists an x € S and a prime p such that f(x) is divisible by p for every fe R. 


Proof. For anyx e S let L = {fœ € R}. This is a subgroup of the additive group of Z and 
hence either Z, = Z or there exists a prime p such that p|f(x) for every fe R. If we have this for 
some x, then we have the second alternative. Now assume Z, = Z for every x € S. Then for each 
x, we can choose an f, € R such that f (x)= 1 and hence (f, — 1,) (x) = 0. Then Mes(f—1s) = 0, 
Expanding this gives an expression for l, as a polynomial in the f, with integer coefficients. 
Thus l € R. O 


We shall also require 


LEMMA 2. For any g € Gand any prime p there exists a p-quasi-elementary subgroup H 
of G such that * is not divisible by p (0, the unit representation of H). 


Proof. We can write 4%) = Z x W where ptiZi and |W| = p*. Let N be the normalizer of Z in G and 
let A be a Sylow p-subgroup of S =N;Z containing the p-group “@/Z (BAI, p. 81). Then A = 
H/Z where H is a subgroup of N containing %9). Since Z< H and rtIZl and |H/Z] is a power of p, 
H is p-quasi-elementary. We wish to show that ¢#'@) is not divisible by p. By (86), this is 
equivalent to showing that the number of cosets aH such that g(aH) = aH is not divisible by p. 


If aH satisfies this condition, then a` lga € H and hence a 'Za c H. Since H is p-quasi- 
elementary, Z is the only subgroup of H of order |Z|. Hence we have a 'Za = Z and a € N. 
Hence we have to count the number of cosets aH of H in N such that g(aH) = aH. Now 
consider the action of % on M/H. Since Zan and Z CH, z(aH) = aH ifz € Z anda e N. 
Hence Z is contained in the kernel of the action of 4% on M/H. Hence we have the action of 49) 
/Z on NH in which (gZ)(aH) = gaH. Since |‘@>/Z| = p*, every orbit of the action of %/Z and 
hence of <@ on MH has cardinality a power of p (BAI, p. 76). It follows that the number of 
non-fixed cosets aH under the action of g is divisible by p. Hence the number of fixed ones, 
x9) = [N:H] (mod p). Since H contains a Sylow p-subgroup of N, [N: H] is not divisible by p. 
Hence %«# is not divisible byp. O 


We can now complete the first part of the proof of Brauer’s theorem by proving 


THEOREM 5.21. Any complex character of G is an integral linear combination of 
characters induced from quasi-elementary subgroups. 


Proof. In the notation we introduced, this means that if O denotes the family of quasi- 
elementary subgroups of G, then chg(G) = ch(G). Since chg(G) is an ideal in ch(G), it suffices 


to show that y; € chg(G). Now let R be the subrng of chọ(G) generated by the induced 
G ; 3 : G 
characters o, of unit characters o, of quasi-elementary subgroups of G. It is clear thato, € Z 


C; hence, R c 2°. Hence, by Banaschewski’s lemma, if 4: €*e(®), then x, É Rand so for some g 


e G there exists a prime p such that y(g) = 0 (mod p) for every y € R. This contradicts Lemma 
2, which provides for any g € G and any prime p, a y € R such that y(g) is not divisible by p. 


Hence cho(G)=ch(G). Oo 


We prove next that any irreducible complex representation of a p-quasi-elementary group is 
monomial and has degree a power of p. In terms of characters this has the following form 


THEOREM 5.22 (Blichfeldt-Brauer). Let y be an irreducible character of a p-quasi- 
elementary group G. Then 


(1) the degree y(1) is a power of p, and 
(2) y =4°% for some linear character À of a subgroup of G. 


Proof. We have G = ZP where Z<G, Z is cyclic with ptiZ!, and P is a p-group. Let p be a 
representation of G affording y. 


(1) Let c be an irreducible component of p,, and T the inertial group of ø. Then Z= T and 


the index of T in G is a power of p. By Corollary 4 to Theorem 5.20, p = yf for an irreducible 
representation y of T. Hence if T + G, the result follows by induction on |G|. Thus we may 
assume T = G. Since Z is abelian, ø is of degree 1, and since Z<G = T and p is irreducible, it 
follows from Clifford’s theorem that every a € Z acts as a scalar in the space V on which p 
acts. Evidently this implies that pp is irreducible. Since P is a p-group, the degree of p, which 


is a factor of |P|, is a power ofp. 


(2) Let the degree y(1) = p”. We shall use induction on n. The result is clear if n = 0 since 
in this case y is linear. Hence we assume n > 0. If å is a linear character of G, then yA is a 
character whose degree yA(1) = y(1). Hence either y = yA or y is not a component of vA. The 
condition y = yA is equivalent to (y\yA) = 1 and since 


(zlzå) = jg Din x(g)A(g) = (zlá) 


it is equivalent also to #714) = !, Now xz is the character of p@p* and “!4)=! is equivalent to the 
fact that the multiplicity of 4 in *% is 1. Let A be the set of linear characters 4 of G such that yA = 
y. Itis clear that A is a group under multiplication. We have 


(99) w= ) AtLy 

ÅEN 
where the y' are non-linear characters. Since y(1) and the y'(1) are divisible by p and 4(1) = 1, 
it follows from (99) that |A| = 0 (mod p). It follows that there exists a 2, € A such that 2, # %1 
(the unit character) and 1,? = y,. Since J, is linear, it is a homomorphism of G into the 
multiplicative group of complex numbers, and since 1,? = y; and A; # 7, the image is the group 
of pth roots of unity. Hence, if K = ker 2, then K = G: and [G :K] =p. By (99), we have 


(190) Ikik = } Ax t+ Lk 
AeA 


and A176 = 71x. This shows that the multiplicity of the unit representation in pg®pg* is at least 
two and hence if (|)x denotes the scalar product on K, then («Z«lzinde > 2. Then Uel%ee > 4, which 
implies that px is reducible. Let o be an irreducible component of px, y the ae of o. 
Since y(1) and (l) are powers of p and y(1) < x(1), we have y(1) > py(1). Since o is an 
irreducible component of px, p is an irreducible component of o? by Frobenius reciprocity. 


Since w°(1) = pw(1) and y(1) > py(1), it follows that y = yf. The result now follows by 
induction. O 


We shall now make the passage from quasi-elementary groups to elementary ones. As 
before, let G = ZP where Z is a cyclic normal subgroup of G of order prime to p and P is a p- 
group. Let W= CAP) = Z N CoP), H = WP. Then H is a p-elementary subgroup of G with P 


as normal Sylow p-subgroup. We shall need the following 


LEMMA. Let A be a linear character of G such that H C ker 1. Then 4 = y,, the unit 
character of G. 


Proof. It suffices to show that Z c ker 4. Let K = Z N ker å. Let b € Z, d e P. Then dA(b- '!dbd- 
') = 1 since 4 is a homomorphism. Hence d(bK)d- ! = bK and so every coset bK of K in Z is 
stabilized under the conjugations by the elements of P. Thus we have an action of P by 


conjugations on the coset bK. Since |P| is a power ofp and |bK| = |K| is prime to p, we have a 
fixed point under this action. Thus bK N CAP) # Ø, which implies that 2(b) = 1. Since b was 


arbitrary in Z, we have Z c ker à. O 


We can now prove 


THEOREM 5.23. Any irreducible character of a quasi-elementary group is an integral 
linear combination of characters induced from linear characters of elementary subgroups. 


Proof. Let y be a complex irreducible character of a p-quasi-elementary group G. If y(1) > 1, 
the result follows from Theorem 5.22 by induction on |G|. Hence assume y(1) = 1, that is, y is 
linear. Let H be the p-elementary subgroup WP defined above and let 7 = yy. By the Frobenius 
reciprocity theorem the multiplicity of y in 7° is 1. Now let y' be a linear component of 7°. 
Again, by Frobenius reciprocity, 7 is a constituent of y’,,. Hence y'y = 4 = yy. Then y" =% ! 
is a linear character on G such that H c ker y". Hence by the lemma, y” is the unit character on 
G. Then y'= y. Thus 7° = y + 6 where @ is a sum of characters of degree > 1. Since the 
induction on |G| implies that 0 is an integral linear combination of characters induced from 
linear characters of elementary subgroups and since y is linear on the elementary subgroup H, 
the required result follows for y=°-6. o 

Evidently Brauer’s theorem is an immediate consequence of Theorems 5.22 and 5.23 and 
transitivity of induction. 


We shall now derive an important consequence of the theorem: a characterization of 
generalized characters among the class functions. Let cf(G) denote the set of complex class 


functions on G. Evidently cf(G) is a subring of €f containing ch(G). We know also that cf(G) is 
the C-vector space spanned by the irreducible characters %4, ..., 7, and we have defined y? for 
a class function on a subgroup H by (96). Since the maps y ~ yy and y ~ yw are linear, it is 
clear that formula (97), y%y = (wz,,)°, is valid also for class functions. We can use this to 
establish 


BRAUER’S CHARACTERIZATION OF GENERALIZED CHARACTERS. 4 class 
function @ on G is a generalized character if and only if Qy is a generalized character for 


every elementary subgroup H of G. 


Proof. Let ch(G)' be the set of class functions ø of G satisfying the stated condition. It is clear 
that ch(G)' is a subring of cf(G) containing ch(G). Now let w be a character on some 


elementary subgroup H of G and let y € ch(G)'. Then yy € ch(H), so yyy € ch(H) and (wy) 
e ch(G). Then ¥“% = zw)" ech(G), Since any element of ch(G) is an integral linear combination of 
characters of the form yw’, y a character on some elementary subgroup, this implies that ch(G) 


is an ideal in ch(G)'. Since y; € ch(G), we have ch(G)' = ch(G), which is equivalent to the 
statement of the criterion. O 


We remark that Brauer’s characterization also gives a characterization of Ihe irreducible 
characters, since y € ch(G) is an irreducible character if and only if (y\y) = 1 and y(1) > 0. 
This is clear since y € ch(G) if and only if x= Lina, n; € Z, and &lx) = £»? = 1 implies that all n; 
= 0 except one that has the value + 1. The condition y(1) > 0 then gives y = x; for some j. 


EXERCISES 


1. A group G is called an M-group if every irreducible complex representation is 
monomial. Note that Theorem 5.22.2 states that every quasi-elementary group is an M- 
group. Show that the direct product of M-groups is an M-group. Show that any nilpotent 
group is an M-group (see BAI, pp. 250-251, exercises 4—11). 


2. (Taketa.) Show that every M-group is solvable. (Sketch of proof: If the result is false, 
then there is a minimal counterexample: a non-solvable M-group G with |G| minimal. 
Since any homomorphic image of an M-group is an M-group, if 4-6 and A + G, then G/A 
is solvable. Let A and B be minimal normal subgroups of G. If A + B, then 4 N B = 1 and 
G is isomorphic to a subgroup of G/A x G/B and so is solvable by the minimality of G. 
Hence A = B and there exists a unique minimal normal subgroup A of G. There exist 
irreducible representations p of G such that ker r # 4. Let p be one of minimal degree and 


let p = of where ø is a representation of degree 1 of a subgroup H. Put p’ = cf where 
g is the unit representation of H. Show that ker p' is an abelian normal subgroup # 1 of 
G. Then G is solvable contrary to hypothesis.) 


The following exercises sketch a proof of an important theorem on induced characters due 
to Artin: 


THEOREM 5.24. Lety be a complex character of G that is rational valued. (y(g) € Q for 
all g € G.) Then 


a az _y G 


where x, is the unit character, the a, € Z, and the summation is taken over the cyclic 
subgroups Z of G. 

3. Let y be a rational valued character on G and let g, h e G satisfy %9 =<. Show that y(g) 

= y(h). (Hint: If {9 |= m, then h = g* where (k, m) = 1. Let A?” be the cyclotomic field 

of mth roots of unity over Q, so A” = Q(e) where € is a primitive mth root of unity. 


Show that there exists an automorphism o of A‘”/@ such that oe = e* and that if g ~ diag 
{Ep En... Enp & E Ais a matrix representation, then h = g* ~ diag {OE}, Ep, «++ 
o¢,,}. Hence conclude that y(e) = 7(h).) 


4. Let y be as in exercise 3. Define an equivalence relation = in G by g = h if “> and ¿h> are 
conjugate in G (“> = ata ! for some a € G). Let Dj, Ds, ..., D, be the distinct 
equivalence classes determined by =. Note that these are unions of conjugacy classes 
and we may assume D} = {1}. Let g; € D; and let {a| = n; N(“), the normalizer of ‘>. 
Show that 


(102) |D;| = [G:N(<g;>)]o(n)) 
(g the Euler g-function). Let ®; be the characteristic function of D, so ®{g) = 1 if g € 
D; and ®(g) = 0 otherwise. Prove by induction on n; that 


(103) IN(G = Lani yt 


for a; € Z where a; = 0 unless “4” is conjugate to a subgroup of w. 


5. Note that y is an integral linear combination of the functions ®;. Use this and (103) to 
prove Artin’s theorem. 


5.13 BRAUER’S THEOREM ON SPLITTING FIELDS 


In this section, we shall prove that if m is the exponent of a finite group G, that is, the least 
common multiple of the orders of its elements, then the cyclotomic field A of mth roots of 
unity over Q is a splitting field for G. This result was conjectured by Maschke around 1900, 
but was not proved in complete generality until 1945 when a proof was given by Brauer, based 
on his theory of modular representations. Subsequently, Brauer discovered his theorem on 
induced characters and observed that the splitting field theorem is an easy consequence. We 
shall follow this approach here. 

We recall that a representation p of G acting on V/F is called absolutely irreducible if the 
extension representation p% is irreducible for any field K/F (p. 263). The field F is a splitting 
field for G if every irreducible representation of G over F is absolutely irreducible. We have 
proved (Theorem 5.8, p. 264) that F is a splitting field for Gif and only if F[G] is a direct sum 
of matrix algebras M,(F). 


We now suppose that F is a subfield of €. Then we can give an alternative, more intuitive 
definition of splitting fields. For, as we shall show, F is a splitting field for G if and only if for 
any representation p acting on a vector space V/C, there exists a base for V such that for the 
corresponding matrix representation, the matrices have all of their entries in F. Since we have 


complete reducibility of the representations, this holds if and only if it holds for the irreducible 
representations. It is clear also that the condition can be formulated completely in matrix terms: 
Any complex matrix representation is similar to a matrix representation over F. Now suppose 
F is a splitting field. Then F[G] = M, (F)® ... ® M,, (F) and C[G] = F[G]l_e = M,, (€)® sae @ 
M, (©). If Z, is a minimal left ideal in M, (P), then this is an irreducible module for F[G] and 
{Z]l <j < s} is a set of representatives for the irreducible /[G]-modules. The degree of the 
representation afforded by J; is n; and |G| = zn/. Now Ji¢ is a left ideal contained in the simple 
component M,,(€). Since ie : E] =n, Le is a minimal left ideal in M,,(©) and hence the Lic, | 
<j < s, constitute a set of representatives of the irreducible modules for €[G]. Since any base 
for I/F is a base for Ji¢/€, it is clear that the irreducible complex matrix representations are 


similar to matrix representations over F. Conversely, suppose this condition holds for a field 
F. Then every irreducible complex representation has the form o¢ where o is a representation 
of G over F. Let p, ..., Ps be a set of representatives of the irreducible representations of G, n; 


the degree of p;, and choose a representation g; of G over F such that oj¢ = p;. Then the o; are 
inequivalent and irreducible, and the relation |C! En? implies that F[G] = M, (ŒF) ® ... ® 
M, (F) (see p. 259). Hence F is a splitting field for G. 

Let y and y’ he characters of inequivalent irreducible representations p and p' of G over F. 
If FIG] = Mn, (4,)8'"®Mq(4,) where the A; are division algebras, then we may suppose that p acts 
on a minimal left ideal 7 c M, (Aj) and p' acts on a minimal left ideal 7' of M,, (Az). When we 
pass to ELG] = F[G]ec, Mm (Ape and M,,,(A)e split as direct sums of simple components of €[G] 
and these ideals in €[G] have no common simple components. It follows that if %1, 7, .... Xs 
are the complex irreducible characters, then =>" and x =Xmix, where the m; and m'; are 
nonnegative integers and for every i, either m; = 0 or m'; = 0. Thusi») = Em? >0 and (y\v’) = 0. 
Moreover, y is an irreducible complex character if and only if (yy) = 1, in which case, the 
complex irreducible representation whose character is y is the extension pe. It is also clear 
that F is a splitting field for G if and only if for every irreducible complex character y; there 
exists a representation p; of G over F such that %», = %. 


We can now prove 


THEOREM 5.25. Jf m is the exponent of G, then the cyclotomic field A%/@ of the mth 
roots of unity is a splitting field for G. 


Proof. Let x be an irreducible complex character of G. By Brauer’s theorem on induced 
characters, y is an integral linear combination of characters of the form A? where 4 is a linear 
character of a subgroup H of G. Now / is a homomorphism of H into the multiplicative group € 
* of complex numbers and if h € H, then h” = 1 so A(h)” = 1. Hence A(h) e A). It is clear 
from the definition of induced representations that the representation affording 1% has a 
representation by matrices with entries in A“. Thus 4° is the character of a representation of 


G over A0”, It follows that = ¥1%9; where the k, € Zand 9, ..., Ø, are the characters of the 
inequivalent irreducible representations of G over A). Then ! = &%!2)= Ekolo) and since 


every (9;lp;) is a positive integer, all of the 4; but one are zero and the non-zero one is 1. Hence 


y is a character of an irreducible representation of G over A). This implies that A” is a 
splitting field. o 


5.14 THE SCHUR INDEX 


In this section, we shall study relations between irreducible representations of a finite group G 
over C and over a subfield F of ©. We use the notations of the previous section: 
CLG] = Ma(C)@-@M(C), FLG] = Mu (414 OMAA) where the A; are division algebras. We write 
also A; = Mm, (Aj). 

Let y be an irreducible complex character of G, p a representation of G over € affording y. 
We may assume that p is the restriction of the regular representation of G to a minimal left 
ideal V of one of the simple components M,, (©). Then V is contained in exactly one of the Aie. 
If V c Ag then AV £ 0, but A; V = 0 for every i' + i. This property is independent of the choice 
of the representation p affording v.We shall say that A; is the simple component of F[G] 
belonging to y. We can associate also a subfield of C/F with y, namely, the subfield over F 
generated by the complex numbers y(g), g € G. We denote this as F(y). We have the following 


PROPOSITION 5.9. Let y be an irreducible complex character of the finite group G, F a 
subfield of €, and let A = A; be the simple component of F(G] belonging to x. Then F(x) is 
isomorphic over F to the center of A. 


Proof. As indicated, we may take the representation p affording y to be the restriction of the 
regular representation to a minimal left ideal V c Ae. We can regard p as a representation of € 
[G] and identify F[G] with an F-subalgebra of C[G]. Since p is irreducible, it follows from 
Schur’s lemma that p(cent€[G]) = C1. We recall that the centers of F[G] and C[G] are spanned 


by the elements £c. where C}, ..., Cy are the conjugacy classes of G. If g e C; then Lx«aga™' is 


a non-zero multiple of 2#«.%. Hence the centers are spanned by the elements ‘= Essaga™', We 
have p(cg) = yl, y € €, and hence tr p(cg) = ny, where n = x(1), the degree of p. On the other 


hand, '* (6) = Dectre(aga”') = IGix9). Hence 


(104) p(c,) = —xz(g)I. 


We now restrict the C-algebra homomorphism p of €[G] to F[G]. This gives an F-algebra 
homomorphism of F[G], which maps cent F[G] onto the set of F-linear combinations of the 


elements y(g) 1. Since p maps every simple component A; # A into 0, we have a 


homomorphism of cent A onto F(y) 1. Since cent A is a field, we have an isomorphism of cent 
A onto F(y). o 


We shall now say that a complex character y of G is realizable over the subfield F of € if y 
is the character of a representation o of G over F. Since a representation is determined up to 
equivalence by its character, it is clear that y is realizable over F if and only if p is equivalent 
to og for any representation p affording y. Also, as we noted before, this is the case if and only 
if there exists a base for the space V on which p acts such that the entries of the matrices of the 
p(g), g € G, are all in F. Evidently, if y is realizable over F, then F contains every y(g), g € 
G. 

We shall now show that ify is an irreducible complex character of G and F is a subfield of 
€ containing all the y(g), then there exists a positive integer d such that the character dy (of the 
direct sum of d copies of the representation p affording y) is realizable over F. The minimum 
such d is called the Schur index of y over F. We prove the existence of the Schur index and 
give a structural description of this integer in the following 


THEOREM 5.26. Let y be an irreducible complex character of G, F a subfield of € 
containing all of the x(g), g € G, and let A= M,(A), where A is a division algebra, be the 


simple component of FG] belonging to y. Then [A : F| = Æ and the character dy is 
realizable over F. Moreover, if d' is any positive integer such that d'y is realizable over F, 
then d\d' and hence d is the Schur index of y over F. 


Proof. Since F = F(y), A is central simple by Proposition 5.9. Hence A is central, [A : F] = d 
(p. 222), and [A : F] = mœ. Any two minimal left ideals of A are isomorphic as 4-modules, 
hence as vector spaces over F, and A = /}®...®J,, where J;, 1 <j <m, is a minimal left ideal. 
Hence [J : F] = md? and [Iç : €] = md’. Since A is central simple, Ae is simple and hence this 
is one of the simple components of C[G]. Since [Aç : €] = md”, we have Ae = M,,,(€) and the 
calculation we gave for F shows that if V is a minimal left ideal of Ag, then [V : €] = md. We 
have seen that the representation p affording y can be taken to be the restriction of the regular 
representation to a minimal left ideal contained in Ae. We may take this to be V. Now lç is a 
left ideal in Ap, and [Iç : €] = md? while [V : €] = md. Hence Iç is a direct sum of d left ideals 
isomorphic to V. This implies that the direct sum of d complex irreducible representations 
equivalent to p is equivalent to t¢ where zt is the representation of G acting on J/F. Hence dy is 
realizable over F. Now suppose d' 1s a positive integer such that d'y is realizable over F. Then 
the €[G]-module V’, a direct sum of d' copies of V, is isomorphic to a module J ‘¢ where I' is 
a module for F[G]. Now V) is annihilated by every simple component of F[G] except A and 
hence /’ is annihilated by every simple component of F[G] except A. It follows that r = 1 
where J is a minimal left ideal of A. Then V“’ = fe = (Uc) = (V" = V“, Hence d' = dh. Evidently 
this implies that dis the Schur index. o 


The foregoing result shows that the Schur index of y is 1 if and only if A = F and A = 
M„(F). Hence y is realizable over F = F(z) if and only if A= M„(F). 


Now let E be a subfield of € containing F = F(y) and consider E[G]. Since the simple 
component A of F[G] belonging to y is central simple, A; is a simple component of E[G]. 
Evidently this is the simple component of E[G] belonging to y. Hence y is realizable over EF if 
and only if A; = M,(£), that is, E is a splitting field over F of A (p. 220). This is the case if and 
only if E is a splitting field for the division algebra A such that A = M,,(A). The basic criterion 
for this was given in Theorem 4.7 (p. 218). According to this result, a finite dimensional 
extension field E/F is a splitting field for a finite dimensional central division algebra A/F if 
and only if E is isomorphic to a subalgebra E’ of a matrix algebra M(A) such that the 
centralizer of E' in M,(A) is E', and if this is the case then [E : F] = rd where [A : F] = œ. This 
result and Theorem 5.26 give the following 


THEOREM 5.27. Let the notations be as in Theorem 5.26 and let E be a finite 
dimensional extension field of F contained in €. Then y is realizable over E if and only if E 
is isomorphic to a subalgebra E' of the matrix algebra M,(A) such that Cua yE’) = E'. 


Moreover, in this case [E : F] = rd where d is the Schur index of y over F. 


5.15 FROBENIUS GROUPS 


We shall conclude this chapter by applying the theory of characters to derive an important 
theorem of Frobenius on finite groups. Frobenius’ theorem can be viewed in two different 
ways: first as a theorem on transitive permutation groups and second as a theorem on abstract 
groups. Also, as we shall show, the result is related to the study of fixed-point-free 
automorphisms of finite groups. Finally, we shall consider an example of a Frobenius group 
whose character analysis leads to a variant of a classical proof of the quadratic reciprocity 
law of number theory. 

First, let G be a permutation group of the set N = {1, 2, ..., n}, n > 1, such that (1) G is 
transitive; (2) For any i, the stabilizer Stab i = {g € Glgi =i} # 1; and (3) No element of G 
except 1 fixes more than one element of {1, ..., n}. Let H; = Stab i. Then condition (3) is 
equivalent to H; N H; = 1 if i #7. Also any two of the subgroups of H, H; are conjugate: H, = 
8H, = gH g !. For, there exists a g € G such that gi = j. Then Stab j = Stab gi = g(Stab 7)g !. It 
is clear also that if g ¢ H; then gi = j 4 i and £H, = Hj + H;. We remark also that since n > 2, 
H,#G. 

We now introduce the following 


DEFINITION 5.4. A finite group G is called a Frobenius group if G contains a subgroup H 
such that (i)! FĦ F6, and (ii) for any g € G— H, H N £H = 1. The subgroup H is called a 


Frobenius complement in G. 


It is clear from the foregoing remarks that if G is a permutation group satisfying conditions 
(1), (2), and (3) above and n > 2, then any one of the subgroups H = H, satisfies the conditions 


of Definition 5.4. Hence G is a Frobenius group with H a Frobenius complement in G. 
Conversely, suppose G is Frobenius with Frobenius complement H, and consider the action of 
G on the set G/H of left cosets of H. We know that this action is transitive. Moreover, a(gH) = 
gH if and only if a € £H, so Stab(gH) = £H. Then the kernel of the action is [\s"# = 1. Hence we 
can identify G with the corresponding group of permutations of G/H. It is clear that condition 
(2) holds and condition (3) is equivalent to af #bH=*H #'H, This follows from (ii) in the 
definition of a Frobenius group. 

Thus we see that the concept of a Frobenius group is the abstract version of the permutation 
group Situation we considered first. We can now state 


FROBENIUS’ THEOREM. Let G be a Frobenius group with Frobenius complement H. 
Put 


(105) K={}(J(G- 9H 
geG 


Then K is a normal subgroup of G, G= KH, and KOA H= 1. 


If we adopt the permutation group point of view, we see that K can be described as the 
union of {1} and the set of transformations that fix noi € N. 


For the proof we require the following 


LEMMA. Let G be a Frobenius group with Frobenius complement H and let 9 be a class 
function on H such that (1) = 0. Then (9°), = @. 


Proof. By definition 


| 
(106) oĉlg) == F ġlaga™!). 


Then g°(1) = 0 since g(1) = 0. Now let h € H, h #1, and let a e G -— H. Then aha`! ¢ H and 
@(aha~') = 0. Hence 
G l : -1 l -1 
o°(h)=— }, ġlaha™')= — Y glaha™') = g(h). 
\H| acH H| acH 


Thus (9°), =o. O 


We can now give the 


Proof of Frobenius’ theorem. We note first that the definition of Frobenius complement H 
implies that distinct conjugates 1H and ®2H have only the unit element in common and the 
normalizer No(H) = {g&H = H} = H. Hence |{8H}| = [G : N(M] = [G : H]. Then 
lUseo*H! = [G:H]IH|-[G:H]+1 and so, by (105), 


(107) IK| = [G:H]. 


Now let w be a complex irreducible character of H different from the unit character y, and put 
o = y — wW1)y. Then ọ is a generalized character of H satisfying g(1) = 0. Hence, by the 
lemma, of is a generalized character of G such that (Øf) y = o. Then, by Frobenius reciprocity, 


lotlot) = (POO wn = (PlO)n = 1+ WL)? 
(9° 174)g a (olYi)y ce: —y(1) 


for yı, the unit character of G. Now put y* = of + w(1)y,. Then y* is a generalized character 
of G and 


(W*|W*)¢ = lotlot) +W). Ae +41)? 
=1+4(1}-W4(1} +41} =1. 
Also 


Wn = (a +Y) Ge = 0+4 =Y. 


Since y* is a generalized character of G satisfying (y*|y*)¢ = 1 and y*¥(1) = y(1) > 0, y*¥ is a 
complex irreducible character of G. Thus for each irreducible complex character w # y of H, 
we have defined an irreducible complex character y* of G that is an extension of y. If p* is a 
representation of G affording y*, then ker y* = {g € G | w*(g) = w*(1)} = ker p* (p. 271). 
Now put 


(108) K* = f) ker y* 
Ws 


where the intersection is taken over the irreducible characters w of H, y # yı. Evidently K* is 


a normal subgroup of G. Let k + 1 be in K. Then k ¢ £H for any g € G and y%(k) = 0, by 
definition (106). Hence w*(k) = y*(1). Thus K c K*. Next let h e H N K*. Then y(h) = y*(h) 
= y*(1) = w1). Thus A is in the kernel of every irreducible character of H. Then p(h) = 1 for 
the regular representation p of H and hence h = 1. Hence H  K* = 1. Thus we have 
K°aG, Ko K*, HY K*=1, and |G| = |A|[G: H] = |HI|K|. Hence HK* is a subgroup of G and 


HK*/K* = Hj(H ^ K*), so |ĦK*| = IHIIK*VIH ^ K*| = |H|IK*] is a factor of |G| = |H| |K]. Hence |K*||K| and 
since K* > K, K = K*. We now have K = K*aG. Ho K =1 and |K| = [G : H]. Then |KA] = |K||A] = 
|G| and hence G = KH. This completes the proof. o 

The subgroup K is called the Frobenius kernel of the Frobenius group G with complement 
H. We note that no proof of Frobenius’ theorem without characters is known. 

Now let A be a group of automorphisms of a finite group G. We shall say that A is fixed- 
point-free if no a # 1 in A fixes any g # 1 of G. We shall now relate the study of pairs (G, A) 
where G is a finite group and A is a fixed-point-free group of automorphisms #1 of G to the 
study of Frobenius groups. We recall the definition of the holomorph Hol G of a group G as the 
group of transformations Gz Aut G where Gz is the set of left multiplications gz : x ~ gx (BAI, 
p. 63). If a e Aut G, then ag; = (ag);a. Hence if g, € G, a; € Aut G, then 
(91% )(921%2) = Wi% 92222, Aut G and Gz are subgroups of Hol G, G; is normal, G; N Aut G= 1, 
and Hol G = G; Aut G. It is clear that if A is a subgroup of Aut G, then G;A is a subgroup of the 
holomorph. Now suppose 4 # 1 and 4 is fixed-point-free. If g # 1 is in G, then 
“A = {(gxg)" "| xe A}, Hence *AnA = 1 and "An A= 1 for any u € GrA -— A. Thus G,A is a Frobenius 
group with Frobenius complement A. It is clear also that Gz is the Frobenius kernel. 

Conversely, suppose that G is a Frobenius group with Frobenius complement H and let K 
be the Frobenius kernel. The group H acts on K by conjugation and these maps are 
automorphisms of K. Suppose h € H and k € K satisfies ”k = k. Then kh = hk and “h = h € H. 
It follows that either h = 1 or k = 1. This implies that the homomorphism of H into Aut K 
sending h into the map x ~ hxh~!is a monomorphism, so H can be identified with a subgroup 
of Aut K. Moreover, this group of automorphisms is fixed-point-free. 

A result that had been conjectured for a long time and was proved by Thompson is that the 
Frobenius kernel of a Frobenius group is nilpotent. The foregoing considerations show that this 
is equivalent to the following fact: A finite group having a group of automorphisms # 1 that is 
fixed-point-free is nilpotent. For a proof of this theorem, see page 138 of Characters of Finite 
Groups by W. Feit (see References). 


EXAMPLE 


Let B be the subgroup of GSL,(F), F = Z/(p), p an odd prime, consisting of the matrices of the 
form 


109 ( i 
( ) h a`! 


Evidently, |B| = p(p — 1). B contains subgroups K and D where K is the set of matrices (109) 
witha = 1 and D is the set of diagonal matrices. We have K=8, B = DK, D N K= 1. Moreover, 


a OSNA O\fa! O l () 
(110) 7 )- ii | 
0 a b l 0 a RF L 


and 


1 O\/a 0 | 0 a () 
(111) i = 7 ies Bi 
p iii a Je, ‘ Pie 'b a ) 


Put G = B/{1, — 1} and let 4 denote the image in G of a subset A of B under the canonical 
homomorphism of B onto G. We may identify K with R. Since the matrix (111) is contained in 
D if and only if a = 1, it follows that G is a Frobenius group with 5 as a complement and K as 
the kernel. 

We wish to obtain a character table for G. Let v be a generator of the cyclic multiplicative 
group F* of the field F. Then the elements of 5 are diag{v.v"',, 1 <i < (p — 1)/2. Since elements in 
a complement in a Frobenius group are conjugate in the group if and only if they are conjugate 
in the complement, the (p — 1)/2 elements of 6 determine (p — 1 )/2 distinct conjugacy classes. 
The class 1 has a single element and the classes of the elements diag {v.»~'}, 1 <i < (p — 3)/2, have 
cardinality p. Formulas (110) and (111) imply that we have two more conjugacy classes with 


representatives 
( | 0 | | | 
Ti y L 


(since v is not a square in F*). The number of elements in these classes is (p — 1)/2. Altogether 
we have (p — 1)/2 + 2 = (p + 3)/2 conjugacy classes and hence we have this many irreducible 
complex characters. 

Since ð is cyclic of order (p — 1)/2, it has (p— 1)/2 linear characters. Using the 
homomorphism of G onto G/K = b, we obtain (p — 1)/2 linear characters for G, which give the 


following table: 
3 ( 0 ( i) H + car YB 
al (ea) | fo wh sise- 
hi | 1 l | | 
h2 l | | w a) = e$"iip- 1 
Xip-1)/2 l | | | wy? 32 


Since the conjugacy class of diag{v.v"), has cardinality p, the orthogonality relation (60) implies 


that the remaining entries in the columns headed by this class are 0’s. 

The subgroup K of G is cyclic with generator “ *), Hence we have a homomorphism o of K 
into € mapping the generator into the primitive pth root of unity ¢ = e*”’?. This defines the 
representation p = of of G. By (110), 


a Lye as a al 
O B = (9 
F a't/\l 1/\ 0 l 


where the exponent is an integer. If a # +1, then g? + ¢. Hence the representation of K such 
that 


is inequivalent to ø. It follows from Corollary 2 to Theorem 5.20 (p. 304) that p = of is an 
irreducible representation of G. If y is its character, then the degree of p is y(1) = (p — 1)/2. Let 
x' be the remaining irreducible character. The relation 


(p~1)/2 


pip-/2=|Gl= Y x(t +z(1 +71) 
1 


= (p— l )/2+(p- | )*/4+ 7'1) 


gives y'(1) = (p — 1)/2. Hence the last two rows of the character table have the form 


x | (p-1)/2 | € d | 0 
epee jf’ 
= C j d | () 


7 | (p—1)/2 | 


The orthogonality relations (60) for columns 1 and 2 give (p — 1 )/2 + c(p—1)/2 + c'(p—1)/2 = 
0, so c + c'=-—1. Similarly, d + d'= -—1. The orthogonality of y with y, gives (p — 1)/2 + c(p — 
1)/2 + d(p — 1)/2 = 0, so c + d = —1. These relations imply that c' = d, d' = c. Hence the last 
two rows have the form 


X | (p—1)/2 | C Hise] 0 


— 


X | (p-1)2 |-1-e| ce | 0 
and it remains to determine c. We need to distinguish two cases: 
Case 1. (— 1/p) = 1, that is, — 1 is a quadratic residue mod p, or equivalently, p = 1 (mod4) 


(BAI, p. 133). In this case © ® is conjugate to (1 =G D ', Then c is real. Then the orthogonality 
relation (60) applied to the second and third columns gives (p — 1 )/2 — 2c(1 + c) = 0, which 


implies that ¢ = (=! £vpy2, 

Case 2. (—1/p) = —1, —1 is a non-square mod p, so p = 3 (mod4). In this case, “ ® is not 
conjugate to its inverse and hence c ¢ R. Then y' = y*, so **~'~*. Then the orthogonality 
relation (60) for the second and third columns gives (p — 1 )/2 + c? + (1 + c)? = 0, which 
implies that ¢=(~-! +v =p)2, 


These two cases complete the character tables, although we do not know which 


determination of the signs gives the character y of p = of. We can amalgamate the two cases by 
writing p* = (— 1 /p)p. Then in both cases we have 


(112) c= (—1 +,/p*)/2. 


We now apply the induced character formula (85) to the character y ofp. Since ð is a cross 
section of G relative to K, we obtain 


} 1 0 
(1 13) (— | + V p*)/2 = % ( i mn ge - e2tilp 


xeS 


where S is the set of positive integers less than p that are squares modulo p. 
We proceed to apply formula (113) to give a proof of the law of quadratic reciprocity of 
elementary number theory. We introduce the Gauss sum 


(114) j= x (Fe 


x \P 
where 1 <x < p-—1. Since !+=.s*=° (by the factorization #"~! = Me-t), we obtain from (113) 
the formula 
(115) g* = pt. 


Now let q be an odd prime 4p. We note that (p*/q) = p*⁄ ~ D? (modg) (BAI, p. 129, 
exercise 13). We now work in the ring R of integers of the cyclotomic field Q(¢) and use 
congruences modulo qR (BAI, pp. 278-281). We have 


p* 
(116) (Z) = g% ' (mod qR). 
q 


Since for "ER. Œu) = Eu" (mod qR), we have 


Hence 


K` q\ 
k Je =gi= e) (mod qR). 
q Pp 


Then (p*/q)g* = (q/p)g? and (p*/q)p* = (q/p)p*. Since the coset of p* = (— 1/p)p is a unit in 
R/qR, this implies that (p*/q) = (q/p). Since the coset of 2 is a unit, it follows that (p*/q) = 


(q/p). Hence 
q\_ p" "eL Ae I2) 
EA q )-(— 
7" 
uF q 


\ \ 


= (— 1) W2 naf? 
y 


which is the law of quadratic reciprocity whose first proof was published by Gauss in 1801. 


Jam indebted to D. R. Corro for pointing out this example of a Frobenius group with its 
application to the reciprocity law. The derivation following the formula g* = p* is classical 
(apparently first given by Jacobi in 1827). See Sechs Beweise des Fundamentaltheorems uber 
quadratische Reste von Carl Friederich Gauss in W. Ostwald’s “Klassiker der exacten 
Wissenschaften,” pp. 107—109. 


EXERCISES 


1. Show that if G is a Frobenius group with complement H, then the Frobenius kernel is the 
only normal subgroup of G satisfying G = KH, KN H= 1. 


2. Show that if G is a dihedral group of order 2m, m odd, generated by a and b such that a” 
= 1, b? = 1, bab-'! =a" !, then Gis a Frobenius group. 

3. A subset S of a group Gis called a T.I. set (trivial intersection set) if S # Ø and for any g 
e G, either ES = S or ES N S c {1}. Show that if S is T.I., then N¢(S) = {g € GIS? = S} 
is a subgroup containing S. Show that if G is a Frobenius group with Frobenius 
complement H, then H is a T.I. set. 

4. (Brauer-Suzuki.) Let S be a T.I. set in G and let N = N-(S). Suppose that g and y are 
complex class functions on N such that g and wy are 0 outside of S and g(1) = 0. Show 
that (i) pg) = 9(g) for any g + 1 in S, and (ii) (Ply) = (PW). 

5. (Brauer-Suzuki.) Let S, G, and N be as in exercise 4 and let y4, ..., Wm be complex 
irreducible characters of N such that y,(l) = ... = w,,(1). Show that there exists e = +1 
and complex irreducible characters %4, ..., Xm of G such that #4 =" —¥)"), 


6. Let G be a Frobenius group with Frobenius complement H and Frobenius kernel K 
Assume K is abelian. Use exercise 8, p. 63 of BAI to show that if p is an irreducible 
representation of G. then either (1) K c ker p or (ii) p is induced from a degree one 
representation of K. Conclude that G is an M-group if and only if H is an M-group. (This 
result holds without the hypothesis that K is abelian.) 


Sketch of proof: Suppose that (1) fails. Then, by Clifford’s theorem, there exists an 
irreducible constituent À of px such that À is not the unit representation 2, of K. We claim 


that To(4) = K. Otherwise, by Corollary 2, p. 304, there exists an **©@—* = Use*# such that 


*, = 4. But H acts fixed-point-freely on K; so does £H for any g (since K= 6G:). Thus x 
induces a fixed-point-free automorphism of K, so by the exercise cited 


K = {xkx tk '|ke K}. 
Then 
Alxkx 1k!) = à(xkx Ak") 
= “A(k)A(k)' 
= A(k)A(k)~* 
=|. 


Hence K c ker À, contrary to A + 44. Thus T¢(4) = K and so (11) follows from Corollary 
4 on p. 305. 


7. Let F be a finite field with q = p” elements, p a prime. Define the following subgroups 
of GL (F): 


a" Q 
r,=4( | aeF*,beF} 


fan ()\ | 
D ={(" Jacke 


| 


Show that if D, # 1, then T, is a Frobenius group with Frobenius complement D,, and 


Frobenius kernel K. 
(Note: These groups have been used by W. Feit to obtain an estimate for the number of 


solutions in F” of an equation of the form Lie" =%«eF*, See Feit, p. 140, in the 
references below.) 
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Elements of Homological Algebra with Applications 


Homological algebra has become an extensive area of algebra since its introduction in the mid- 
1940’s. Its first aspect, the cohomology and homology groups of a group, was an outgrowth of 
a problem in topology that arose from an observation by Witold Hurewicz that the homology 
groups of a path-connected space whose higher homotopy groups are trivial are determined by 
the fundamental group z,. This presented the problem of providing a mechanism for this 


dependence. Solutions of this problem were given independently and more or less 
simultaneously by a number of topologists: Hopf, Eilenberg and MacLane, Freudenthal, and 
Eckmann [see Cartan and Eilenberg (1956), p. 137, and MacLane (1963), p. 185, listed in 
References]. All of these solutions involved homology or cohomology groups of z,. The next 


step was to define the homology and cohomology groups of an arbitrary group and to study 
them for their own sake. Definitions of the cohomology groups with coefficients in an arbitrary 
module were given by Eilenberg and MacLane in 1947. At the same time, G. Hochschild 
introduced cohomology groups for associative algebras. The cohomology theory of Lie 
algebras, which is a purely algebraic theory corresponding to the cohomology theory of Lie 
groups, was developed by J. L. Koszul and by Chevalley and Eilenberg. These disparate 
theories were pulled together by Cartan and Eilenberg (1956) in a cohesive synthesis based on 
a concept of derived functors from the category of modules over a ring to the category of 
abelian groups. The derived functors that are needed for the cohomology and homology 
theories are the functors Ext and Tor, which are the derived functors of the hom and tensor 
functors respectively. 

Whereas the development of homological algebra proper dates from the period of the 
Second World War. several important precursors of the theory appeared earlier. The earliest 
was perhaps Hilbert’s syzygy theorem (1890) in invariant theory, which concerned free 
resolutions of modules for the ring of polynomials in m indeterminates with coefficients in a 
field. The second cohomology group H(G. €*) with coefficients in the multiplicative group C* 
of non-zero complex numbers appeared in Schur’s work on projective representations of 
groups (1904). More general second cohomology groups occurred as factor sets in Schreier’s 


extension theory of groups (1926) and in Emmy Noether’s construction of crossed product 
algebras (1929). The third cohomology group appeared first in a paper by O. Teichmiiller 
(1940). 

In this chapter we shall proceed first as quickly as possible to the basic definition and 
results on derived functors. These will be specialized to the most important instances: Ext and 
Tor. In the second half of the chapter we shall consider some classical instances of homology 
theory: cohomology of groups, cohomology of algebras with applications to a radical splitting 
theorem for finite dimensional algebras due to Wedderburn, homological dimension of modules 
and rings, and the Hilbert syzygy theorem. Later (sections 8.4 and 8.5), we shall present 
another application of homology theory to the Brauer group and crossed products. 


6.1 ADDITIVE AND ABELIAN CATEGORIES 


A substantial part of the theory of modules can be extended to a class of categories called 
abelian. In particular, homological algebra can be developed for abelian categories. Although 
we Shall stick to modules in our treatment, we will find it convenient to have at hand the 
definitions and simplest properties of abelian categories. We shall therefore consider these in 
this section. 
We recall that an object 0 of a category C is a zero object if for any object A of C, hom,(A, 
0) and hom,(0, A) are singletons. If 0 and 0’ are zero objects, then there exists a unique 
isomorphism 0 — 0 (exercise 3, p. 36). If A, B € obC, we define 0, g as the morphism 0,0 49 
where 0 49 is the unique element of hom,(4,0) and Oppg is the unique element of hom,(0, B). It is 
easily seen that this morphism is independent of the choice of the zero object. We call 04 g the 
zero morphism from A to B. We shall usually drop the subscripts in indicating this element. 
We can now give the following 
DEFINITION 6.1. A category C is called additive if it satisfies the following conditions: 
AC1.C has a zero object. 
AC2. For every pair of objects (A,B) in C, a binary composition + is defined on the set 
hom((4,B) such that (hom((A, B), + ,04 g) is an abelian group. 
AC3. If A,B,C € ob Cf fifa € homc(4 B), and g, g), g> € homc(B, C), then 


(gi +92) f =g f+9f 
Ili +A) = gfi +92 
AC4. For any finite set of objects {A),..., A,} there exists an object A and morphisms 


Pj : A> A, i; : A; > A, 1 <j <n, such that 


pil; = Ly Pi; = 9 fj #k 


We remark that AC2 means that we are given, as part of the definition, an abelian group 
structure on every hom((4 , B) whose zero element is the categorically defined 0, g. AC3 
states that the product fg, when defined in the category, is bi-additive. A consequence of this is 
that for any A, (hom¢(4, A), +, -, 0, 1 = 1,) is a ring. We note also that AC4 implies that (A, 
{p;}) is a product in C of the 4, 1 <j < n. For, suppose B € obC and we are given 
fB => Ay 1 Sj <n. Put f= di; fe home(B, A) Then p,f = f, by (1) and if pif =f, for 1 < k <n, then 
(1) implies that / = }i,/, =f- Hence fis the only morphism from B to A such that p,f = fọ, 1 < k 
<n, and (A, {p;}) is a product of the Aj. In a similar manner we see that (A, {i;}) is a coproduct 
of the 4;. 


It is not difficult to show that we can replace AC4 by either 
AC4’. C is a category with a product (that is, products exist for arbitrary finite sets of 
objects of C), or 
AC4". C is a category with a coproduct. 


We have seen that AC4 = AC4’ and AC4” and we shall indicate in the exercises that AC1—3 
and AC4' or AC4" imply ACR. The advantage of AC4 is that it is self-dual. It follows that the 
set of conditions defining an additive category is self-dual and hence if C is an additive 
category, then C% is an additive category. This is one of the important advantages in dealing 
with additive categories. 

If R is a ring, the categories R-mod and mod-R are additive. As in these special cases, in 
considering functors between additive categories, it is natural to assume that these are additive 
in the sense that for every pair of objects A,B, the map F of hom(4,B) into hom(FA, FB) is a 
group homomorphism. In this case, the proof given for modules (p. 98) shows that F preserves 
finite products (coproducts). 

We define next some concepts that are needed to define abelian categories. Let C be a 
category with a zero (object), f: A —>B in C. Then we call k : K — Aa kernel of f if (1) k is 
monic, (2) fk = 0, and (3) for any g : G— A such that fg = 0 there exists a g’ such that g = kg’. 
Since k is monic, it is clear that g'is unique. Condition (2) is that 


A f 


(1) 


B 


K 


is commutative and (3) is that if the triangle in 


is commutative, then this can be completed by g': G — K to obtain a commutative diagram. It 
is clear that if k and k’ are kernels of f, then there exists a unique isomorphism u such that k’ = 
ku. 

In a dual manner we define a cokernel of f as a morphism c : B — C such that (1) c is epic, 
(2) cf = 0, and (3) for any A :B — H such that hf = 0 there exists h'such that A = h'e. 


Iff: A — B in R-mod, let K = ker f in the usual sense and let k be the injection of K in A. 
Then k is monic, fk = 0, and if g is a homomorphism of G into A such that fg = 0, then gG c K. 
Hence if we let g’ be the map obtained from g by restricting the codomain to K, then g = kg’. 
Hence k is a kernel of f. Next let C = B/fA and let c be the canonical homomorphism of B onto 
C. Then c is epic, cf = 0, and if h : B — H satisfies Af = 0, then fA c ker h. Hence we have a 
unique homomorphism h': C = B/fA — H such that 


B . C = B/fA 


H 


is commutative. Thus C is a cokernel of f in the category R-mod. 
We can now give the definition of an abelian category 


DEFINITION 6.2. A category C is abelian if it is an additive category having the following 
additional properties: 

ACS. Every morphism in C has a kernel and a cokernel. 

AC6. Every monic is a kernel of its cokernel and every epic is a cokernel of its kernel. 

AC7. Every morphism can be factored as f= me where e is epic and m is monic. 

We have seen that if R is a ring, then the categories R-mod and mod-R are additive 

categories satisfying AC5. We leave it to the reader to show that AC6 and AC7 also hold for 
R-mod and mod-R. Thus these are abelian categories. 


EXERCISES 


1. Let C be a category with a zero. Show that for any object A in C, (A, 14,0) is a product 
and coproduct of A and 0. 

2. Let C be a category, (A, Pı, p>) be a product of A, and A, in C, (B, q1, q2) a product of B, 
and B, in C, and let h; : B; —> A;. Show that there exists a unique f: B — A such that hq; 
= pj. In particular, if C has a zero and we take (B, q1, q2) = (A), 14,, 0) then this gives a 
unique 7, : A; — A such that p;i; = 14), poi; = 0. Similarly, show that we have a unique i, 
: A, — A such that pi» = 0, Poin = 1 yo. Show that (iip; + inp2)i, = 1, and (ipi + inp)i> = 
i>. Hence concluded that ip; + ipp) = 14. Use this to prove that the conditions AC1 — 
AC3 and AC4’ < AC4. Dualize to prove that AC] — AC3 and AC4" < AC4. 

3. Show that if A and B are objects of an additive category, then 0, g = 09 304 9, where 04 
o is the zero element of hom(4,0), 0o, g is the zero element of hom(0, B), and 04, g is the 
zero element of the abelian group hom(A, B). 

4. Let (4: M A2.pı-P3) be a product of the objects and A; and A, in the category C. Iff : B — A, 
denote the unique f: B — A, I A, such that pf = f; by fi II fy. Similarly, if (4; U 4, i4, 
i2) is a coproduct and g, : 4; > C, write gı U g, for the unique g : A, U A, —> C such that 
gi; = gj. Note that if C is additive with the i; and p; as in AC4, then f IT fy = ifi + inf 
and g} U g = gp; + g&2p2. Hence show that if i}, i>, p}, p> are as in AC4, so A= A, II A, 
= A, U A>, then 


(9, Ug) (A Mf) =g fi tgh 
(from B —>C). Specialize A, = A, = 0, g1 = 2 = 1, to obtain the formula 
ith, = (Ie Ue) (fi TA) 
for the addition in hom, (8, C). 


5. Use the result of exercise 4 to show that if F is a functor between additive categories that 
preserves products and coproducts, then F is additive. 


5.2 COMPLEXES AND HOMOLOGY 


The basic concepts of homological algebra are those of a complex and homomorphisms of 
complexes that we shall now define. 


DEFINITION 6.3. If R is a ring, a complex (C, d) for R is an indexed set C = {Ci} of R- 


modules indexed by Z together with an indexed set d= {dji € 2} of R-homomorphisms d; : C; 
— C; _ , such that d; _ ıd; = 9 for all i. If (C, d) and (C’, d') are R-complexes, a (chain) 
homomorphism of C into C' is an indexed set * = \%\i€2} of homomorphisms a; :C; > Cj’ such 
that we have the commutativity of 


Ci 4, Cia 
(2) a, Q1 
C, d; fal 


for every i. More briefly we write ad = d'.a 


These definitions lead to the introduction of a category R-comp of complexes for the ring R. 
Its objects are the R-complexes (C, d), and for every pair of R-complexes (C, d), (C', d"), the 
set hom(C, C’) is the set of chain homomorphisms of (C,d) into (C', da’). It is clear that these 
constitute a category, and as we proceed to show, the main features of R-mod carry over to R- 
comp. We note first that hom(C, C") has a natural structure of abelian group. This is obtained 
by defining a + p for a, p e hom(C, C’) by (a + p); = a; + Pi The commutativity a;_ id; = d'a,, 
Pid; = d' Pi gives (a;_, + Pi- )d; = d'(a; t p:i), so a+ p e hom(C, C’). Since hom,(C,, C’;) is 
an abelian group, it follows that hom(C, C’) is an abelian group. It is clear also by referring to 
the module situation that we have the distributive laws y(a + p) = ya + yf, (a + B)d = aô + pô 
when these products of chain homomorphisms are defined. If (C, d) and (C’, d”) are complexes, 
we can define their direct sum (C ® C', d ® d'^) by (C + C'i = C,® C/, d;® d/ defined 


component-wise from ©! OCC 1 @ Cia as (xax) >(dxsdix). Tt is clear that 


(di-ı ® di- 1)(d: Bd) = 0, so (C C',d@d') is indeed a complex. This has an immediate extension 
to direct sums of more than two complexes. Since everything can be reduced to the module 
situation, it is quite clear that if we endow the hom sets with the abelian group structure we 
defined, then the category R-comp becomes an abelian category. 

The interesting examples of complexes will be encountered in section 4. However, it may 
be helpful to list some at this point, although most of these will appear to be rather special. 


EXAMPLES 


1. Any module M becomes a complex in which C, =M, i € Z, and d;=0:C,;— Cii 


2. A module with differentiation is an R-module equipped with a module endomorphism 6 
such that & = 0. If (M,ô) is a module with differentiation, we obtain a complex (C, d) in which 
C,=0 fori < 0, (= C = C = C3 =M, C = 0 for j > 3, d, =d; =ò, and d= 0 ifi # 2, 3. 


3. Let (M, ô) be a module with a differentiation that is Z-graded in the following sense: 
M = ®«-M, where the M; are submodules and ô(M,) = M; _ ; for every i. Put C; = M, and d; = 


ôļlM,. Then C= {C}, d= {d,} constitute an R-complex. 


4. Any short exact sequence 0+M’+M+M’-+0 defines a complex in which 
C,=0, i <0, C, =M”, C, = M, C= M'. C)=0 if j>3, d, =B, dy = 9, dj=Ojf7 423, 


We shall now define for each i € Z a functor, the ith homology functor, from the category of 
R-complexes to the category of R-modules. Let (C, d) be a complex and let Z(C) = ker d; so 


ZC) is a submodule of C;. The elements of z; are called i-cycles. Since d;d; , ; = 0, it is clear 
that the image d; , C; + ; is a submodule of Z;. We denote this as B; = B({C) and call its 
elements i-boundaries. The module H; = H({C) = Z,/B; is called the ith homology module of 
the complex (C, d). Evidently, C+:4¢;C.-1 is exact if and only if HC) = 0 and hence the 
infinite sequence of homomorphisms 


RE tO Ea 


is exact if and only if HC) = 0 for all i. 

Now let a be a chain homomorphism of (C, d) into the complex (C’,d’). The commutativity 
condition on (2) implies that *4:¢ Zi = ZAC) and ™(8)< B; = BIC), Hence the map =~ %z+Bh 362, 
is a homomorphism of Z; into H"; = H(C’) = Z'/B'; sending B; into 0. This gives the 
homomorphism & of H,(C) into H(C’) such that 


(3) z;+B; > 02; +B;. 


It is trivial to check that the maps (C, d) ~ HC), hom(C, C) —> hom(H,(C), H,(C’)), where the 
latter is a ~ %, define a functor from R-comp to R-mod. We call this the ith homology functor 
from R-comp to R-mod. It is clear that the map a ~ % is a homomorphism of abelian groups. 
Thus the ith homology functor is additive. 

In the situations we shall encounter in the sequel, the complexes that occur will have either 
C;= 0 for i < 0 or C;=0 for i > 0. In the first case, the complexes are called positive or chain 
complexes and in the second, negative or cochain complexes. In the latter case, it is usual to 
denote C_; by C’ and d _ ; by d. With this notation, a cochain complex has the appearance 


do 


1! d? 
TEET hE pi LAN je 


if we drop the C~, i > 1. It is usual in this situation to denote ker d! by Z'and d’ ~ !C'~ ! by B’. 
The elements of these groups are respectively i-cocycles and i-coboundaries and H! = Z'/B' is 


the ith cohomology group. In the case of if H?, we have H? = Z°. A chain complex has the form 
0-Co#C,#C,+---.. In this case Hy = Co/d,C, = coker d} 


EXERCISES 


1. Let a be a homomorphism of the complex (C, d) into the complex (C’, d^). Define 
C= C-C, ie2 and if *-16Ci-1 ie Ci,, define #%i- 1.8) = (dh iXi- 1%- 1X + xD Verify that (C", d 
") is a complex. 

2. Let (C, d) be a positive complex over a field F such that Xdim C, < œ (equivalently 
every C; is finite dimensional and C, = 0 for n sufficiently large). Let r; = dim C; p; = 
dim HC). Show that X(-1)/p, = X(-1)’r;. 

3. (Amitsur complex.) Let S be a commutative algebra over a commutative ring K. Put 
S = K, S*=S®--@S, n factors, where ® means ®g. Note that for any n we have n + 1 algebra 
isomorphisms #.1 <i< «+1, of S” into S”*! such that 


X1@*"'@x, > X, O°" @X;_ OION AX 


For any ring R let U(R) denote the multiplicative group of units of R. Then US.) = Uis,..), 
Define @:U(S,) + US..." -1, by 


n+l 


d'u = |] (0u 
i=i 


(e.g., d'u = (8u (8u) (8 uy’). Note that if i > j, then 8#! = 6/8! and use this to show 
that d” * 1d” = 0, the map u ~ 1. Hence conclude that {U(S"), d”\n > 0} is a cochain 
complex. 


5.3 LONG EXACT HOMOLOGY SEQUENCE 


In this section we shall develop one of the most important tools of homological algebra: the 
long exact homology sequence arising from a short exact sequence of complexes. By a short 
exact sequence of complexes we mean a sequence of complexes and chain homomorphisms 
ccc” such that 0+ci5¢,4c;+0 is exact for every i e Z, that is, a; is injective, 8; is 
surjective, and ker J; = im a;. We shall indicate this by saying that 0 > C' —> C— C'— 0 is 
exact. We have the commutative diagram 


in which the rows are exact. The result we wish to prove is 


THEOREM 6.1. Let 0+c+c+c"’ +0 be an exact sequence of complexes. Then for each i € 
Z we can define a module homomorphism 4: 4éC")4,-\\C) so that the infinite sequence of 
homology modules 


(4) ++ HC) Hi(C) 4 HC") > H,-1(C) 9 HC) 
is exact. 


Proof. First, we must define A;. Let z"; e Z(C"), so d"z", = 0. Since J; is surjective, there 
exists a c; € C; such that f,c;=z";. Then f;_ d;c; = d" pici = d",z"; = 0. Since ker #;_ ; = ima; _ 
; and a;_ 1s injective, there exists a unique z’,_ , E€ C’;_ ; such that a; _ ,z';_ | = d;c; Then a; _ 
adi- 125-1 = 4; 
Z; _\(C’). Our determination of z’;_ , can be displayed in the formula 


(5) Zi- 1 Eai (dip; '(2/')) 


where £~ |) and a |, ,( )denote inverse images. We had to make a choice of c; € £7 '(z") at 
the first stage. Suppose we make a different one, say, %€Ar (f). Then Pë =e: implies that c 
asata Geci, Then %-ri-1+dic)) = dei tdci = dde taci) = di, Thus the replacement of c; by & 
replaces z} _ 1 by z^ 1 + d'c';. Hence the coset z'_; + B (C^ in H; (C^ is independent of 


i- 


1 Qi 1245 1=d; idic; = 0. Since a; _ 7 is injective, d';_ ,z';,_ ,; =Osoz';_,€ 


the choice of c; and so we have a map 

i mE ' 
(6) 2, 2-1 + B,- (C) 
of ZC") into H; _ ,(C’). It is clear from (5) that this is a module homomorphism. Now suppose 
z"; e B(C"), say, * = Eac uti ECGs, Then we can choose cay € Cay so that B; 4 1641 = C" 
and then Bidis iss = die sBis ities = dis sehen = zi, Hence disti Br (2) and since didisı =9, we have z’ _ | 
= 0 in (5). Thus B(C”) is in the kernel of the homomorphism (6) and 


(7) AEE +B(C") > 2)_, + B,-\(C) 


is a homomorphism of H(C”)into H;_ (C. 

We claim that this definition of A; makes (4) exact, which means that we have 
l: im à, = ker Ë, I1: im Și; = ker A, and HI: im A, = ker, 4, 

I. It is clear that /#a=9 so s©4(0) Suppose z; e ZC) and 
Bazi+ BAC) = 0. so fz, = d?o cjeni EC, There exists a cj}; € C;4 1 such that B; + 1¢;4, = C44 
and so it~ dir ities) = diers die ifisi = 0. Then there exists zhe C’; such that a,2'; = z;-d; + 1C; + 
1. Then a d'z = dažs dz- ddis = 0. Hence d'z = 0 and z} € ZC’). Now 
ulzi + BAC) = 242) + BAC) = zi— dis eis 1+ BAC) = 2+ BMC), Thus z; + B(C) € im % and hence ker ħi c imā 
and hence ker ® = im &. 

Il. Let z; € Z(C) and let z" = fz; Then Afz",+B(C")) = 0 since z; € B; \(z") and dz; = 0, 
so a;_ 0 =d,Z;. Thus AË (z; + B(C)) = 0 and im’ c ker A;. Now suppose z"; € Z{C") satisfies 
A(z", + B{C")) = 0. This means that if we choose c; € C; so that fic; =z"; and z';_ ; €C';_ ; so 
that a, 47; _ 1 = de, then z _ , = d'c} for some c' e C',;. Then 
lzi + BAC’))= zi + BAC) = zi— dis pein. + BAC) = 24+ BC) and dic; = ajc’) = 0. Also Ble wci) = Bee; = z, 
Hence, if we put z; = c; — a,c’, then we shall have z; €Z,(C) and Bie + BAC) = B+ BAC) = z} + BKC), 
Thus ker A; c im and hence ker A; = im By, 


M. Ifz';_ | €Z;_ , andz';_ ; + B;_ (C) € imA,, then we have az"; e Z(C") anda ce C 


such that fic; = z"; and aZ; = dc; Then %-1@i-1+Bi-(C) = -17-1 +B-1(C)=0, Hence im 
A,cker %_,. Conversely, let z^; + Bi-(C)eker %-1, Then %-13-1 = dcn 6C, Put z” = pici Then d"z 
" = Haide eC = 0, so z” € ZC"). The definition of A; shows that 
Ai(zi + BC") = zi-1+B:-1(C), Thus ker %_; c im A; and hence ker %_ , = im A; O 


The homomorphism A; that we constructed is called the connecting homomorphism of 
HC") into H; _ ;(C’) and (4) is the Jong exact homology sequence determined by the short 


exact sequence of complexes 0+c+cC4c"+0, An important feature of the connecting 
homomorphism is its naturality, which we state as 


THEOREM 6.2. Suppose we have a diagram of homomorphisms of complexes 


C a C B G 


which is commutative and has exact rows. Then 


H,(C") z H,_,(C’) 
(9) h' fi- 


H,(D") A H;_,(D') 


is commutative. 


By the commutativity of (8) we mean of course the commutativity of 
c a S Bı G 
D; a D, 5, Dy 
for every i. The proof of the commutativity of (9) is straightforward and is left to the reader. 


EXERCISE 


1. (The snake lemma.) Let 


be a commutative diagram of module homomorphisms with exact columns and middle 
two rows exact. Let x" e K" and let y e M satisfy uy=f"x". Then wy = g"uy =9'f"*" = 0 and 
there exists a unique z’ e N’ such that v'z' = gy. Define Ax” = h’z’. Show that Ax” is 
independent of the choice of y and that A : K” — C’is a module homomorphism. Verify 
that 


n A 


KSKSK"5C0404C 


is exact. Show that if w’1s a monomorphism, then so is K' and if v is an epimorphism, 
then so is y. 


6.4 HOMOTOPY 


We have seen that a chain homomorphism a of a complex (C,d) into a complex (C’,d’) 
determines a homomorphism & of the ith homology module Hi(C) into H(C’) for every i e Z. 


There is an important relation between chain homomorphisms of (C,d) to (C'd^ that 
automatically guarantees that the corresponding homomorphisms of the homology modules are 
identical. This is defined in 


DEFINITION 6.4. Let a and p be chain homomorphisms of a complex (C, d) into a 
complex (C', d'). Then a is said to be homotopic to P if there exists an indexed set s = {s;\ of 


module homomorphisms s;: C; > C'; 1, i € Z, such that 
4, — h: = di, 15; +5;- idi. 
We indicate homotopy by a ~ P. 
Ifa ~ B, then & = 4 for the corresponding homomorphisms of the homology modules AC) 
— H(C’). For, ifz; € ZC), then %22:+B; > %2,+B; and By:2;+ 8; > Bi2i+8;, Hence 
a,(2,+B;) = aizi+ Bi = (B;+d;, .8;+58;- ,d;)z;+B; 
= (8,2; +d}, 15,2;) +B; = pizi +B; = f(z, + B)). 


It is clear that homotopy is a symmetric and reflexive relation for chain homomorphisms. It 
is also transitive, since if a ~ p is given by s and p ~ y is given by ź, then 


a —B; = dj 418; +5;- 14; 
Bi-¥i = dia titty}. 


Hence 


a, = di 4 (Sj tti) + (5,1 tti- 1); 


Thus s + t= {s; + t;} is a homotopy between a and y. 


Homotopies can also be multiplied. Suppose that a ~ p for the chain homomorphisms of (C, 
d) — (C’, d') and y ~ ô for the chain homomorphisms (C’, d) — (C", d"). Then ya ~ df. We 
have, say, 


as = d; 18; +58;-14;, 
—O; = di, st; +t; 1d}. 
Multiplication of the first of these by y; on the left and the second by J; on the right gives 
Yii — Vipi = Vidi s 8+ ViS:— 14; = i Via Si + Yii- 14; 
A OB; = dis 0B +t iB; = dis 0B +t; By 1dj 
(by (2)). Hence 
0) — OB; = disisit tibi) + OiSi-1 +6: 1B) 1): 


Thus y a ~ ô pP via u = {u;} where u;= yi + ls; + tP; 


5.5 RESOLUTIONS 


In the next section we shall achieve the first main objective of this chapter : the definition of 
the derived functor of an additive functor from the category of modules of a ring to the category 
of abelian groups. The definition is based on the concept of resolution of a module that we now 
consider. 


DEFINITION 6.5. Let M be an R-module. We define a complex over M as a positive 
complex C = (C, d) together with a homomorphism € : Cy — M, called an augmentation, such 


that ed, = 0. Thus we have the sequence of homomorphisms 


(10) eC ee, our CO Mad 


where the product of any two successive homomorphisms is 0. The complex C, € over M is 

called a resolution of M if (10) is exact. This is equivalent to H{C) =0 for i > 0 and H)(C) = 

Co/d,C, = Co/ker £ = M. A complex Ce over M is called projective if every C; is projective. 
We have the following important 


THEOREM 6.3. Let C, e be a projective complex over the module M and let C, &' be a 


resolution of the module M', u a homomorphism of M into M'. Then there exists a chain 
homomorphism a of the complex C into C' such that ue = €'a9. Moreover, any two such 


homomorphisms a and p are homotopic. 


Proof. The first assertion amounts to saying that there exist module homomorphisms a;, i > 0, 
such that 


C Ciz Cy» 


/ , / 
Ca 1 C,-2 


is commutative. Since Cy is projective and Co M'—0 is exact, the homomorphism ye of Co 
into M’ can be “lifted” to a homomorphism aq : Co —> C'h so that ue = c'ag. Now suppose we 
have already determined a, ..., a,_; so that the commutativity of (11) holds from Cp to C, _ 1. 
We have 4n~1%, = 145 = %-24n-14,=9 Hence a, _ 1d C, C ker d',_ , =imd', =d',C',. We can 
replace C’, , by d',C’, for which we have the exactness of C’, — d',C', — 0. By the 
projectivity of C, we have a hommomorphism a, : C, — C', such that d', a, = a,_\d,. This 
inductive step proves the existence of a. Now let a and p satisfy the conditions. Let y = a — £. 
Then we have 


EA )—& Bo = pe— ue = 0, 
=7,-14 moi. 


/ 
VAJ 
€ Yo 


d 


' 
^) 
nin 


We have the diagram 


Since £' yo = 0, yoCo C d'1C' We have the diagram 


Co 
Yo 


ar 


with exact row. As before, there exists a homomorphism sọ : Cy — C^ such that yọ = d'\59. 
Suppose we have determined spo, ..., s,,_, such that s; : C; > C';, ; and 

Yi = di 41S: +5;- 14), 0<i<n-—l. 
Consider Vn Sn14p- We have ayn —~ Sn 1da) = Yna- 14, — diy Sp- id, = (Ya- 1 — G5, id, = Sh- 20n- id, ni 0., It 
follows as before that there exists a homomorphisms, : C, — C, + 1 such that d’, + 18, = Yn — Sh 
_ ,d,. The sequence of homomorphisms so, $1, ... defines a homotopy of a to p. This completes 
the proof. D 


The existence of a projective resolution of a module is easily established. In fact, as we 
shall now show, there exists a resolution (10) of M that is free in the sense that the modules C; 


are free. First, we represent M as a homomorphic image of a free module Cp, which means that 
we have an exact sequence ker € + Cy 5 M — 0 where Cp is free and i is the injection of ker e. 
Next we obtain a free module C} and an epimorphism z of C} onto ker e. If we put d, = ia: C} 
— Cy, we have the exact sequence C, “+ Cy M — 0. Iteration of this procedure leads to the 
existence of an exact sequence 


dn d E 
i e Oe W > C> M>0 
where the C; are free. Then (C, d) and e constitute a free resolution for M. 


All of this can be dualized. We define a complex under M to be a pair D, n where D is a 
cochain complex and y is a homomorphism M — D? such that d?y = 0. Such a complex under 
M is called a coresolution of M if 


(12) 0+-M5D°S5p'5pD?+--. 


is exact. We have shown in section 3.11 (p. 159) that any module M can be embedded in an 
injective module, that is, there exists an exact sequence 0 > M 4 D? where D? is injective. 
This extends to 0 — M — D? = coker 7 where coker y = D°/nM and ~ is the canonical 
homomorphism onto the quotient module. Next we have a monomorphism 7, of coker 7 into an 
injective module D! and hence we have the exact sequence 0 > M 4 D? $ D! where d? = nyz. 


Continuing in this way, we obtain a coresolution (12) that is injective in the sense that every D 
is injective. The main theorem on resolutions can be dualized as follows. 


THEOREM 6.4. Let (D, 1) be an injective complex under M, (D', n') a coresolution of M', 
A a homomorphism of M' into M. Then there exists a homomorphism g of the complex D' into 


the complex D such that nd = g°n'. Moreover, any two such homomorphisms are homotopic. 


The diagram for the first statement is 


0 
M L] D? P D’ 


The proof of this theorem can be obtained by dualizing the argument used to prove Theorem 
6.3. We leave the details to the reader. 


6.6 DERIVED FUNCTORS 


We are now ready to define the derived functors of an additive functor F from a category R- 
mod to the category Ab. Let M be an R-module and let 


dı 


MME Cpi: 


be a projective resolution of M. Applying the functor F we obtain a sequence of 
homomorphisms of abelian groups 
F(:) F(d;) F(d>) 


(13) 0 FM — FC, + FC, aim 


Since F(0) = 0 for a zero homomorphism of a module into a second one and since F is 
multiplicative, the product of successive homomorphisms in (13) is 0 and so FC = {FC;}, F(d) 


= {F(d;)} with the augmentation Fe is a (positive) complex over FM. If F is exact, that is, 
preserves exactness, then (13) is exact and the homology groups H(/C) = 0 for i > 1. This 


need not be the case if F is not exact, and these homology groups in a sense measure the 
departure of F from exactness. At any rate, we now put 


(14) L,FM =H (FC) n>0. 
This definition gives 
(15) Ho(FC) = FC F(d;)FC, 


since we are taking the terms FC; = 0 ifi < 0. 

Let M' be a second R-module and suppose we have chosen a projective resolution 0 — M' 
— Cy“ C'i ... of M'. from which we obtain the abelian groups H,,(FC’), n > 0. Let u be a 
module homomorphism of M into M’. Then we have seen that we can determine a 
homomorphism a of the complex (C, d) into (C’, d’) such that we = e'ay. We call a a chain 
homomorphism over the given homomorphism u. Since F is an additive functor, we have the 
homomorphism F(a) of the complex FC into the complex FC’ such that F(u)F (e) = Fle) F(a). 


Thus we have the commutative diagram 


FM F(e) FCo F(d,) FC, 


Then we have the homomorphism F n) of H, (FC) into H,(FC’). This is independent of the 
choice of a. For, if f is a second homomorphism of C into C' over u, p is homotopic to a, so 
there exist homomorphisms s„ : C, > C'h + p A È 0, such that a, — Bn = d'n + 8, +S, —1 de 
Since F is additive, application of F to these relations gives 


F(a,)—F(,) = F(d,, )F(S,) + F(s,-1)F(d,). 


Thus F(a) ~ F(f) and hence Fn) = Fi). We now define L, F) = F). Thus a 
homomorphism u : M —> M' determines a homomorphism L,F (u) : H, (FC) —> H, (FC. We 
leave it to the reader to carry out the verification that L,F defined by 


(L,F)M = H (FC), MeobR-mod 
(LF )(u) = F(&,), we hom,(M, M’) 


is an additive functor from R-mod to Ab. This is called the nth left derived functor of the given 
functor F. 

We now observe that our definitions are essentially independent of the choice of the 
resolutions. Let C be a second projective resolution of M. Then taking u = 1 in the foregoing 
argument, we obtain a unique isomorphism 7, of H,(FC) onto H,(FC). Similarly, another 


choice C’ of projective resolution of M’ yields a unique isomorphism y’, of H,,(FC’) onto H,,(F 
C+) and L,F is replaced by 7',(L,F)n, ~ 1. 

From now on we shall assume that for every R-module M we have chosen a particular 
projective resolution and that this is used to determine the functors L„F. However, we reserve 
the right to switch from one such resolution to another when it is convenient to do so. 


We consider next a short exact sequence 0 — M'+ M 4, M" — 0 and we shall show that 
corresponding to such a sequence we have connecting homomorphisms 


A,:L,FM" > L,-,FM’, nèl, 


n-1 
such that 
LoF(p) LoF (x) Ay 


0 — LFM" -—— Ly) FM ——L, FM’ — L, FM" ~ 


is exact. For this purpose we require the existence of projective resolutions of short exact 
sequences of homomorphisms of modules. By a projective resolution of such a sequence 0 > 


M' — M— M" — 0 we mean projective resolutions C’, s’, C, e, C", e” of M', M, and M" 
respectively together with chain is homomorphisms i: C’— C, p : C — C" such that for each n, 
0-C!,- C, > C", = 0 is exact and 


Co i Co Po cy 
(16) e e” 
M' a M B M" 


is commmutative. 

We shall now prove the existence of a projective resolution for any short exact sequence of 
modules 0 — M' M 4 M" — 0. We begin with projective resolutions C’, e’, and C", e" of M’ 
and M” respectively: 


2? >C; 7 Cop M + () 
17) ChaCha Cha M90 


3 rT í FA 
We let C, = C'n 2G 27 1, 2,3, ..., X'h 
")- Then 0 > C', > C’, 8 C", > c, — 0 is exact and Ci is projective. We now define ex, 
= aE X'y + OX" o, d Xn = (dX + OX" n d,"X,") Where o : C"o > M', Op: C'a > C'a are to be 


non 


= (x'n, 0) for x', € C'i PnXn = xX", for x, =O yx 


determined so that C, € is a projective resolution of M which together with C’, s’ and C", € 
constitutes a projective resolution for the exact sequence 0 > M' ? M Ë M" — 0. 

We have cigx'o = &(x'9, 0) = as'x’y. Hence commutativity of the left-hand rectangle in (16) is 
automatic. Also e"pgxg = £"x"ọo and fexy = B(ae'x'g + 0x"9) = Box"y. Hence commutativity of 
the right-hand rectangle in (16) holds if and only if 


(18) g" = Ba 


We have edx] = e(d'x' + Ox" 1, d\"x,") = ac'0 x" FO "x". Hence ed} = 0 if and only if 
(19) ae, + od = 0. 
Similarly, the condition d,,_ ;d, = 0 is equivalent to 
! H 
(20) d,-,9, + 0, -1d, = 0. 


Now consider the diagram 


mM’ os Mm” — 0. 
Since C’p is projective there exists a a: C"0 — M' such that (18) holds. Next we consider 


Ci 


4 
/ 
/ w 
6, / —od; 
/ 


/ 
cor M E M” 
Since ¢'Cy'= M' the row is exact, and since C," is projective and fod', = e"d", = 0, there 
exists a 0}: Cı” — C'g such that (19) holds (see exercise 4, p. 100). Next we consider 


C; 


/ 
/ 
/ 
0, / | ~ 0,4; 
/ 
/ 


“ 
Qe Os M 


di e 


Here again the row is exact, C,” is projective and £'9;d", = 0 since ae’0;d," = — od" d", = 0 
and ker a = 0. Hence there exists 0>: C,” — C’, such that (19) holds for n = 2. Finally, the 
same argument establishes (20) for n > 2 using induction and the diagram 


Ca- 3 G-3 Go Gs 


da-i dy- 


It remains to show that ... C, 2 C, # Cy ? M — 0 is a resolution. For this purpose we regard 


this sequence of modules and homomorphisms as a complex € and similarly we regard the 
modules and homomorphisms in (17) as complexes C and C”. Then we have an exact sequence 
of complexes 0 > C' — C > C" — 0. Since H(C') = 0 = HC’) it follows from (4) that H (©) 
= (0. Then Č provides a resolution for M which together with the resolutions for M' and M" 
gives a projective resolution for 0 — M'— M— M" — 0. 

We can now prove 


THEOREM 6.5. Let F be an additive functor from R-mod to Ab. Then for any short exact 


sequence of R-modules 0 — M' + M 4&4 M" — 0 and any n = 1, 2, 3,... there exists a 
connecting homomorphism A,:L,,FM" L,_ ,FM' such that 


(21) O+——LoFM” = LFM LFM L, FM” ——-- 


is exact 


"n 


Proof. We construct a projective resolution C’, e! C, e, C", e", i, p for the given short exact 
sequence of modules. For each n > 0, we have the short exact sequence of projective modules 
0 — C', S C, %C'n = 0. Since C", is projective, this splits and consequently, 0 > FC', "S’ 
FC', “= FC", — 0 is split exact. Thus we have a short exact sequence of complexes 0 — 
F(C) ® F(C) ™® F(C") > 0. The theorem follows by applying the long exact homology 
sequence to this short exact sequence of complexes. O 


Everything we have done can be carried over to coresolutions, and this gives the definition 
and analogous results for right derived functors. We shall now sketch this, leaving the details to 
the reader. 

Again let F be a functor from R-mod to Ab. For a given R-module M, we choose an 
injective coresolution 0 > M4 D? & D! $ D? —> ..., Applying F, we obtain 0 > FM 2% 
FD’ #9) FD! “°.) FD? — ... and we obtain the complex FD = {FD'}, F(d) = {F(d')}. Then we 
put (R”F)M = H"(FD), n > 0. If M' is a second R-module, (D', 7’) a coresolution of M', then for 
any homomorphism 4:M'— M we obtain a homomorphism R" F(A): (R"F)M' — (R"F)M. This 
defines the right derived functor of the given functor F. The results we obtained for left 
derived functors carry over. In particular, we have an analogue of the long exact sequence 
given in Theorem 6.5. We omit the details. 


EXERCISES 


1. Show that if M is projective, then LFM = FM and L,FM = 0 for n > 0. 


2. Show that if F is right exact, then F and LoF are naturally equivalent. 


5.7 EXT 


In this section and the next we shall consider the most important instances of derived functors. 
We begin with the contravariant hom functor hom( —, N) defined by a fixed module N, but first 
we need to indicate the modifications in the foregoing procedure that are required in passing to 
additive contravariant functors from R-mod to Ab. Such a functor is a (covariant) functor from 
the opposite category R-mod*? to Ab, and since arrows are reversed in passing to the opposite 
category, the roles of injective and projective modules must be interchanged. Accordingly, to 
define the right derived functor of a contravariant functor G from R-mod to Ab, we begin with 


a projective resolution 0 — M — Cy = C, — ... of the given module M. This gives rise to the 
sequence 0 > GM “2 GC, ““¥ GC, —... and the cochain complex (GC,G(d)) where GC = 


{GC} and G(d) = (G(d,)}. We define (R"G)M = H"(GC). In particular, we have (R°G)M = ker 
(GCy — GC). For any u € homg(M', M) we obtain a homomorphism (R,,G) (u): (R"G)M > 


(R"G)M' and so we obtain the nth rigrto derived functor R"G of G, which is additive and 
contravariant. Corresponding to a short exact sequence 0 — M' > M — M" — 0 we have the 
long exact cohomology sequence 


(22) 0 > R°GM" > R°GM > R°GM' > R'GM" > R'GM > R'GM' >= 


where R’GM' — R” * IGM" is given by a connecting homomorphism. The proof is almost 
identical with that of Theorem 6.5 and is therefore omitted. 

We now let G = hom(-, N) the contravariant hom functor determined by a fixed R-module 
N. We recall the definition: If M € ob R-mod, then hom(—, N)M = homp(M, N) and if a € 


hom,(M, M’), hom(—, N)(a) is the map a* of hom,(M, N) into hom,(M, N) sending any £ in 
the former into fa e homp(M, N). hom(— ,N)M is an abelian group and a* is a group 
homomorphism. Hence hom(— ,N) is additive. Since (a,a,)* = a*,a,*, the functor hom(— ,N) is 
contravariant. We recall also that this functor is left exact, that is, if M' > M4 M" —> 0 is 
exact, then 0 — hom(M", N) 4 hom(M, N) % hom(M', N) is exact (p. 105). 

The nth right derived functor of hom(-, N) is denoted as Ext”(— ,N); its value for the module 
M is Ext"(M, N) (or Extp"(M, N) if it is desirable to indicate R). If C, € is a projective 
resolution for M, then the exactness of C} > Cy + M — 0 implies that of 0 > hom(M, N) 5 
hom(Cy, N) —> hom(C,, N). Since Ext°(M, N) is the kernel of the homomorphism of hom(Cp, N) 
into hom(C;, N) it is clear that 


(23) Ext°(M, N) = hom(M, N) 


under the map e* 
Now let 0 — M'— M — M" — 0 be a short exact sequence. Then we obtain the long exact 
sequence 


() > Ext?(M", N) > Ext®(M,N) > Ext°(M’, N) 
> Ext'(M”,N) > Ext!(M, N) >+. 


If we use the isomorphism (23), we obtain an imbedding of the exact sequence 0 — hom(M", 
N) > hom(M, N) — hom(M’, N) in a long exact sequence 


0 > hom(M”, N) > hom(M, N) > hom(M’, N) 
> Ext'(M",N) > Ext!(M,N) >+. 


(24) 


(25) 


We can now prove 


THEOREM 6.6. The following conditions on a module M are equivalent: 

(1) M is projective. 

(2) Ext"(M, N) =0 for all n > 1 and all modules N. 

(3) Ext! (M, N) = 0 for all N. 
Proof. (1) = (2). If M is projective, then 0 — M & CG =M<«0<... is a projective 
resolution. The corresponding complex to calculate Ext”(M, N) is 0 — hom(M, N) — hom(M, 
N) > 0 —> .... Hence Ext"(M, N) = 0 for all n > 1. (2) = (3) is clear. (3) = (1). Let M be any 
module and let 0 — K > P 4 M — 0 be a short exact sequence with P projective. Then (25) 
and the fact that Ext! (P, N) = 0 yield the exactness of 


(26) 0-—hom(M,N)- hom(P,N)— hom(K,N)— Ext!(M,N)- 0. 


Now assume Ext! (M, N) = 0. Then we have the exactness of 0 > hom(M, N) > hom(P N) > 
hom(K, N) — 0, which implies that the map 7* of hom(P. N) into hom(K, N) is surjective. Now 
take N = K. Then the surjectivity of 7* on hom(K,K) implies that there exists a ¢ € hom(P K) 
such that J, = <j. This implies that the short exact sequence 0 — K 5 — P 4 M — 0 splits. 


Then M is a direct summand of a projective module and so M is projective. D 


The exact sequence (24) in which 0 — K 5 P 4 M — 0 is exact, M is arbitrary and P is 
projective gives the following formula for Ext! (M, N): 


(27) Ext' (M, N) = coker n* = hom(K, N)/im(hom(P, N)). 


We shall use this formula to relate Ext! (M, N) with extensions of the module M by the module 
N. It is this connection that accounts for the name Ext for the functor. 
IfM and N are modules, we define an extension of M by N to be a short exact sequence 


(28) 0>N5E5M50. 


For brevity we refer to this as “the extension Two extensions Æ} and E, are said to be 
equivalent if there exists an isomorphism y: E; — E, such that 


(29) 


is commutative. It is easily seen that if y is a homomorphism from the extension Æ} to E, 
making (29) commutative, then y is necessarily an isomorphism. Equivalence of extensions is 


indeed an equivalence relation. It is clear also that extensions of M by N exist; for, we can 
construct the split extension 


(30) 0+N+M@N5M=0 


with the usual i and p. 

We shall now define a bijective map of the class E(M, N) of equivalence classes of 
extensions of M by N with the set Ext!(M, N) More precisely, we shall define a bijective map 
of E(M, N) onto coker 7* where 0 — K 4 P 4 M — 0 is a projective presentation of M and 7* 
is the corresponding map hom(P N) — hom(K, N) (y*(A) = ày). In view of the isomorphism 
given in (27), this will give the bijection of E(M, N) with Ext! (M, N). 

Let0 — N+ E4 M — 0 be an extension of M by N. Then we have the diagram 


K y P 


M 
N a E B 


without the dotted lines. Since P is projective, there is a homomorphism 4: P — E making the 
triangle commutative. With this choice of 4 there is a unique homomorphism u :K — N making 
the rectangle commutative. For, if x € K, then Ayx = enx = 0. Hence Ayx e ker p and so there 
exists a unique y € N such that ay = Ayx. We define u by x ~ y. Then it is clear that u is a 
homomorphism of K into N making the rectangle commutative and u is unique. Next, let 2’ be a 
second homomorphism of P into E such that p4' = e. Then f(A’ — 4) = 0, which implies that 
there exists a homomorphism t:P — N such that 2’— 2 = at. Then A'y = (À + at)yn = añ + tn). 
Hence u' = u + ty makes 


K q P 
p A’ 
N a E 


commutative. Since t E€ hom (PAN), ty € imy*. Thus u and u' determine the same element of 
coker y* and we have the map sending the extension F into the element 7 + imy* of coker *. 
It is readily seen, by drawing a diagram, that the replacement of E by an equivalent extension £ 
‘yields the same element u+im 7*. Hence we have a map of E(M, N) into coker 7*. 
Conversely, let u € hom(K, N). We form the pushout of y and 7 (exercise 8, p. 37). 


Explicitly, we form N ® P and let J be the submodule of elements (— u(x), n(x)), x € K. Put E 
= (N ® P)/I and let a be the homomorphism of N into E such that a(y) = (y, 0) + J. Also we 
have the homomorphism of N ® P into M such that (y, z) ~ €(z),y € N,z € P. This maps J into 
0 and so defines a homomorphism f} of E = (N ® P)/I into M such that (y, z) + I ~ e(z). We 
claim that 0 => N+ E 4 M — 0 is exact. First, if a(y) = (y, 0) + 7 = 0, then (y, 0) = (— u(x), 
n(x)), x € K, so n(x) = 0 and x = 0 and y = 0. Thus a is injective. Next, Bay = B((y, 0) + D = 
e(0) = 0, so (Ga = 0. Moreover, if p(y, z) + J) = 0, then e(z) = 0 so z = y(x), x e K. Then (y, z) 
+7=(y+ u(x), 0) +7 =a(y + u(x). Hence ker p = ima. Finally, J is surjective since ifu € M, 
then u = e(z), z e P, and f((0, z) + J) = e(z) = u. If we put A(z) = (0, z) + J, then we have the 
commutativity of the diagram (31). Hence the element of coker 7* associated with the 
equivalence class of the extension FE is u + imy*. This shows that our map is surjective. It is 
also injective. For, let E be any extension such that the class of E is mapped into the coset u + 
imy*. Then we may assume that E ~ u under the original map we defined, and if we form the 
pushout E’ of u and 7, then £’ is an extension such that E’ ~ u. Now since E' is a pushout, we 
have a commutative diagram (29) with E; = E' and E, = E. Then E and E’ are isomorphic. 
Evidently, this implies injectivity. 
We state this result as 


THEOREM 6.7. We have a bijective map of Ext! (M, N) with the set EM, N) of 
equivalence classes of extensions of M by N. 


We shall study next the dependence of Ext”(M, N) on the argument N. This will lead to the 
definition of a functor Ext’(M, —) and a bifunctor Ext”. Let M, N, N’ be R-modules and £ a 
homomorphism of N into N’. As before, let C = {C;}, e be a projective resolution of M. Then 


we have the diagram 


()——> hom(M, N) ———> hom(C,», N) ——>": 


(32) | l 


0——— hom(M, N'):—— hom (Ca, N) |Á: 


where the horizontal maps are as before and the vertical ones are the left multiplications Bz by 
p. It is clear that (32) is commutative and hence we have homomorphisms of the complex 
hom(C, N) into the complex hom(C, MN’); consequently, for each n > 0 we have a 


homomorphism $” of the corresponding cohomology groups. Thus we have the homomorphism 


b": Ext”(M, N) > Ext”(M, N’). It is clear that this defines a functor Ext”(M, —) from R-mod to 
Ab that is additive and covariant. 
Ifa € hom(M', M), we have the commutative diagram 


hom(M,N) ——> hom(M’,N) 


| 
hom(M,N’)——— hom(M',N’), 


which gives the commutative diagram 


Ext"(M,N) ——> Ext"(M’, N) 
l | 
Ext"(M, N') > Ext"(M', N’). 


This implies as in the case of the hom functor that we can define a bifunctor Ext” from R-mod 
to Ab (p. 38). 

Now suppose that we have a short exact sequence 0 — N’— N — N" — 0. As in (32), we 
have the sequence of homomorphisms of these complexes: hom(C, N) — hom(C, N) - hom(C, 
N”). Since C, is projective and 0 — N’ — N—- N" — 0 is exact, 0 — hom(C;, N’) > hom(C;, 
N) — hom(C;, N") — 0 is exact for every i. Thus 0 — hom(C, N’) — hom(C, N) — hom(C, N 
") — 0 is exact. Hence we can apply Theorem 6.1 and the isomorphism of hom(M, N) with 
Ext®(M, N) to obtain a second long exact sequence of Ext functors: 


(33) 0—hom(M,N’) > hom(M,N) > hom(M, N") > Ext'(M,N’) > 
Ext'(M, N) > Ext'(M,N") 3°". 


We shall call the two sequences (24) and (33) the long exact sequences in the first and 
second variables respectively for Ext. 

We can now prove the following analogue of the characterization of projective modules 
given in Theorem 6.6. 


THEOREM 6.8. The following conditions on a module N are equivalent: 

(1) N is injective. 

(2) Ext"(M, N) =0 for all n > 1 and all modules M. 

(3) Ext! (M, N) = 0 for all M. 
Proof. (1) = (2). If N is injective, the exactness of 0 — M <— Cy < Cı + ... implies that of 
0 — hom(M, N) — hom(Co, N) — hom(C), N) — .... This implies that Ext”(M, N) = 0 for all 
n > 1. The implications (2) = (3) are trivial, and (3) = (1) can be obtained as in the proof of 


Theorem 6.6 by using a short exact sequence 0 — N— Q —> L — 0 where Q is injective. We 
leave it to the reader to complete this argument. 0 

The functors Ext”(M, —) that we have defined by starting with the functors Ext”(— ,N) can 
also be defined directly as the right derived functors of hom(M, —). For the moment, we denote 
the value of this functor for the module M as Ext”(M, N). To obtain a determination of this 
group, we choose an injective coresolution 0 > N 5 D° 4 D! — ... and we obtain the cochain 
complex hom(M, D):0 > hom(M, D°) > hom(M, D!) > .... Then Ext”(M, N) is the nth 
cohomology group of hom(M, D). The results we had for Ext can easily be established for Ext. 
In particular, we can show that Ext°(M, N) = hom(M, N) and we have the two long exact 
sequences for Ext analogous to (24) and (33). We omit the derivations of these results. Now it 


can be shown that the bifunctors Ext” and Ext” are naturally equivalent. We shall not prove this, 
but shall be content to prove the following weaker result: 


THEOREM 6.9. Ext"(M, N) = Ext” (M, N)for all n, M, and N. 


Proof. Ifn =0, we have Ext%(M, N) = hom(M, N) = Ext®(M, N). Now let 0 > K > P > M 
— 0 be a short exact sequence with P projective. Using the long exact sequence on the first 
variable for Ext and Ext! (P N) = 0 we obtain the exact sequence 0 > hom(M, N) > hom(P 
N) > hom(K, N) > Ext!(M, N) — 0. This implies that Ext! (M, N) = hom(K, N)/im hom(P. N). 
In (27) we showed that Ext!(M, N) = hom(K, N)/im hom(P N). Hence Ext!(M, N) = Ext!(, 
N). Now let n > 1 and assume the result for n — 1. We refer again to the long exact sequence on 
the first variable for Ext and obtain 


0 = Ext"~!(P, N) > Ext"~!(K, N) > Ext"(M, N) > Ext"(P, N) = 0 


from which we infer that Ext’(M, N) = Ext™ |(K, N). Hence Ext"(M, N) = Ext”-/(K, N) = Ext 
(K, N). The same argument gives Ext"(M, N) = Ext™ ! (K, N). Hence Ext"(M, N) = Ext”(M, 
N).0 


EXERCISES 


1. Let R = D, a commutative principal ideal domain. Let M = D/(a), so we have the 
projective presentation 0 — D— D — M — 0 where the first map is the multiplication 
by a and the second is the canonical homomorphism onto the factor module. Use (27) 
and the isomorphism of homp(D, N) with N mapping y into 71 to show that Ext!(M, N) = 
N/aN. Show that if N = D/(b), then Ext! (M, N) = D/(a, b) where (a, b) is a g.c.d. of a 
and b. Use these results and the fundamental structure theorem on finitely generated 


modules over a p.i.d. (BAI, p. 187) to obtain a formula for Ext!(M, N) for any two 
finitely generated modules over D. 


2. Give a proof of the equivalence of Ext,(M, —) and Ext"(M, —). 


3. Show that in the correspondence between equivalence classes of extensions of M by N 
and the set Ext’ (M,N) given in Theorem 6.7, the equivalence class of the split extension 
corresponds to the 0 element of Ext! (M, N). 


4. Let NŠ E; “» M, i = 1, 2, be two extensions of M by N. Form E, ® E, and let F be the 
submodule of E; ® E, consisting of the pairs (z;, Z2), z; € E; such that B,z, = f5z>. Let K 
be the subset of E; ® E, of elements of the form (ay, — azy), y € N. Note that K is a 
submodule of F. Put E = F/K and define maps a: N > E, p: E — M by ay = (ayy, 0) + 
K = (0, -a xy) + K, f(z), z) + K) = a Zz, = pz). Show that a and p are module 


homomorphisms and N *> E 4 M, so we have an extension of M by N. This is called the 
Baer sum of the extensions N *+ E; & M. Show that the element of Ext’ (M, N) 


corresponding to the Baer sum is the sum of the elements corresponding to the given 
extensions. Use this and exercise 3 to conclude that the set of equivalence classes of 
extensions of M by N form a group under the composition given by Baer sums with the 
zero element as the class of the split extension. 


6.8 TOR 


IfM € mod-R, the category of right modules for the ring R, then M®p is the functor from R- 
mod to Ab that maps a left R-module N into the group M®,N and an element 7 of homp(N, N’) 
into 1@n. M®p is additive and right exact. The second condition means that if N' — N — N" 
— 0 is exact, then M@N' —> M@N — MƏN" — 0 is exact. The nth left derived functor of M® 
(= M®,) is denoted as Tor,(M, —) (or Tor®,(M, —)). To obtain Tor,(M, N) we choose a 
projective resolution of N: 0 — N 5 Co < ... and form the chain complex M@C = {M®@C;}}. 
Then Tor,(M, N) is the nth homology group H,(M®@C) of M8C. By definition, Torg(M, N) = 
(M®C))/im(M®C_). Since M® is right exact, M®C, — M®C, — MƏN — 0 is exact and 
hence M ® N = (M 8 Cp)/im(M ®C,) = Tor p(M, N). 

The isomorphism Torg(M, N) = M®N and the long exact sequence of homology imply that if 
0 — N'—> N —> N" Q is exact, then 


384)  0-M8N"-MQ8QN -MQ N' -Tor (M,N")—Tor, (M,N) =: 


is exact. 

We recall that a right module M is flat if and only if the tensor functor M® from the 
category of left modules to the category of abelian groups is exact (p. 154). We can now give a 
characterization of flatness in terms of the functor Tor. The result is the following analogue of 
Theorem 6.6 on the functor Ext. 


THEOREM 6.10. The following conditions on a right module M are equivalent: 
(1) M is flat. 
(2) Tor (M, N) = 0 for all n> | and all (left) modules N. 
(3) Tor (M, N) =0 for all N. 


Proof. (1) = (2). IfM is flat and 0 — N — Co +— Cı + ... is a projective resolution of N, 
then 0 — M@®N — M®C, — M8C, + ... is exact. Hence Tor,(M, N) = 0 for any n = 1. (2) 
=> (3) is clear. (3) = (1). Let 0 — N' — N'N"'0 be exact. Then the hypothesis that Tor; (M, N 
') = 0 implies that 0 — M@®N — — MƏN — MƏN" — Ois exact. Hence M is flat. D 


We consider next the dependence of Tor (M, N) on M. The argument is identical with that 


used in considering Ext”. Let a be a homomorphism of the right module M into the right module 
M' and as before let 0 <—_N< Co — Cı — ... be a projective resolution for the left module N. 


Then we have the commutative diagram 


0—M @N—M 8CM OC, — 
(35) 1 | 
0+-M'@N—M’@Cyo+M'@C,:« 


where the vertical maps are a@1y, a@1¢,, a®1¢,, etc. Hence we have a homomorphism of the 


complex {M@C; into the complex and a corresponding homomorphism of the homology groups 
Tor,(M, N) into Tor (M', N).In this way we obtain a functor Tor,(—, N)> from mod-R, the 
category of right modules for the ring R, to the category Ab that is additive and covariant. 


We now suppose we have a short exact sequence of right modules 0 — M'— M— M" > 
0 and as before, let C, e be a projective resolution for the left module N. Since the C, are 


projective, 0 > M' @C; — M®C; M"®C, — 0 is exact for every i. Consequently, by Theorem 
6.1 and the isomorphism of Toro(M, N) with M ® N we obtain the long exact sequence for Tor 
in the first variable: 


(36) 0+ M"@N«+M@N«M QN- 
Tor,(M", N) — Tor,(M, N) —Tor,(M’,N)<—:*:. 


Finally, we note that as in the case of Ext, we can define functors Tor, (M, N) using a 


projective resolution of the first argument M. Moreover, we can prove that Jor,(M, N) = Tor, 


„(M, N). The argument is similar to that we gave for Ext and ExtExt and is left to the reader. 


EXERCISES 


1. Determine Tor “(M, N) if M and N are cyclic groups. 
2. Show that Tor “(M, N) is a torsion group for any abelian groups M and N. 


5.9 COHOMOLOGY OF GROUPS 


In the remainder of this chapter we shall consider some of the most important special cases of 
homological algebra together with their applications to classical problems, some of which 


provided the impetus to the development of the abstract theory. 

We begin with the cohomology of groups and we shall first give the original definition of 
the cohomology groups of a group, which, unlike the definition of the derived functors, is quite 
concrete. For our purpose we require the concept of a G-module, which is closely related to a 
basic notion of representation theory of groups. If G is a group, we define a G-module A to be 
an abelian group (written additively) on which G acts as endomorphisms. This means that we 
have a map 


(37) (g,x) > gx 
of G x A into A such that 


g(x+y) = gx+gy 


(38) (9192) = g1 (92X) 


Ix =x 
for g, g), 27 € G, x,y E€ A. As in representation theory, we can transform this to a more familiar 


concept by introducing the group ring Z[G], which is the free Z-module with G as base and in 
which multiplication is defined by 


(39) (So,9) (XB,h) = Xa, brgh 


where ag, Pp € Z. Then if A is a G-module, A becomes a Z[G]-module if we define 


(40) (Sayg)x = Ea lgx): 


The verification is immediate and is left to the reader. Conversely, if A is a Z[G]-module, then 
A becomes a G-module if we define gx as (1g)x. 

A special case of a G-module is obtained by taking A to be any abelian group and defining 
gx =x for all g e G, x e A. This action of G is called the trivial action. Another example of a 
G-module is the regular G-module A = G[Z] in which the action is AZ a42) = } aghg 

Now let A be a G-module. For any n = 0, 1,2, 3,..., let C’(G, A) denote the set of functions 
ofn variables in G into the module A. Thus if n > 0, then C”(G, A) is the set of maps of 
GxGx-*G into A and ifn = 0, a map is just an element of A. C"(G, A) is an abelian group 
with the usual definitions of addition and 0: Iff, f € C"(G, A), then 


(FAA) (Gis-++sIn) =S Oivi 0n) tS Gis +++sIn) 
Olgi- -3 9n) = 9. 


In the case of C°(G, A) = A, the group structure is that given in A. 


We now define a map 8( = 56,,) of C,(G, A) into C” * 1(G, A). Iff e C" (G,A), then we define 
of by 


Of (915+++59n41) = G1 f Gas +++s9n41) 


n 


(41) = pi (IY gi- 9i-19i9i+i gn) 


+D S Oin) 


For n = 0, f is an element of A and 


(42) Òf (91) = g f- Í. 


For n = 1 we have 


(43) Of (91:92) = 91 f (92) -f (9192) +f (91) 


and for n = 2 we have 


(44) Of (91:92:93) = 91 f (9293) —f (9192593) +f (9159293) -S (91:92) 


It is clear that ô is a homomorphism of C" (G, A) into C” + (G, A). Let Z” (G, A) denote its kernel 
and B” + 1 (G, A) its image in C” * 1 (G, A). It can be verified that 6°f = 0 for every f € C"(G, A). 
We shall not carry out this calculation since the result can be derived more simply as a by- 
product of a result that we shall consider presently. From ô(ôf) = 0 we can conclude that Z” (G, 
A) > B"(G, A). Hence we can form the factor group H"(G, A) = Z2"(G, A)/B"(G, A). This is 
called the ftth cohomology group of G with coefficients in A. 

The foregoing definition is concrete but a bit artificial. The special cases of H'(G, A) and 
H(G, A) arose in studying certain natural questions in group theory that we shall consider in 
the next section. The general definition was suggested by these special cases and by the 
definition of cohomology groups of a simplicial complex. We shall now give another definition 
of H"(G, A) that is functorial in character. For this we consider Z as trivial G-module and we 


consider Ext’(Z, A) for a given G-module 4. We obtain a particular determination of this group 
by choosing a projective resolution 


(45) jezi EG 


of Z as trivial Z[G]-module. Then we obtain the cochain complex 
(46) 0 + hom(Cy, A) > hom(C,, A) >+ 


whose nth homology group is a determination of Ext’(Z, A). 


We shall now construct the particular projective resolution (45) that will permit us to 
identify Ext"(Z, A) with the nth cohomology group H"(G, A) as we have defined it. We put 


n+l 


(47) C, = ÍG]&z ®2Z[G]. 


Since Z[G] is a free Z-module with G as base, C, is a free Z-module with base g 8 g8 ... @ 
Z„ g; E G. We have an action of G on C, defined by 


(48) g(Xy® *** @Xq) = 9X08 XA t @X,, 


which makes C, a Z[G]-module. This is Z[G]-free with base 


(49) (gign) =1@g9,@'''@g,, giEG. 


We now define a Z[G]-homomorphism d, of C, into C, — , by its action on the base {(g), ..., 


Sn)}: 
An(G1+++59n) = 91 (Jas-++s9n) 


n-1 


(50) + y = 1)! 5+ +5 9i-19igi+19i+ 29n) 
1 


+ Inia) 


where it is understood that for n = 1 we have d(e) =g; — 1 € Cy = 2[G]. Also we define a Z 
[G]-homomorphism ¢ of Co into Z by e(1) = 1. Then e(g) = e(g1) = gl = 1 and e(Yia,g) = diag. 
We proceed to show that 0 — z-S Co 4 C} — > is a projective resolution of Z. Since the C; 


are free Z[G]-modules, projectivity is clear. It remains to prove the exactness of the indicated 
sequence of Z[G]-homomorphisms. This will be done by defining a sequence of contracting 
homomorphisms: 


S- S 
Z=>0 >05. 
By this we mean that the s; are group homomorphisms such that 


A A diSn+5_1é = le, 
(51) 1 4 120 1 Co 


da+iSntSn-1d = lc, nèl. 


We observe that {1} is a Z-base for Z, G is a Z-base for Cy = Z[G], and {29(81, ..., Z,) = Zo 
® g, ®  @ gig, € G} is a Z-base for C,, n = 1. Hence we have unique group 


homomorphisms s _ 4 : Z — Cp, Sn : C,_ 1 — C, such that 


(52) Sals l, Sogo = (Yo), 
SnJo(G1s+++s9n) = (JosGis-++s9n)s n> 0. 


Ifn > 0, we have 


Ans 1SnJo(G1s++*s9n) = dnt Goss ++sIn) 


1 
= Iogi ga) +d, (D Oogie 9n) 


i=0 
+(— 1)" go...,9n-1) 
Sn-1dn9olg1 9n) = Sn—1G904n(G1s+++sGn) 


n-1 


= Sn—19091(G2s-+++9n) + a Ki 1)'Sp—1Go(G19+++sGiJit19+++9Gn) 
1 


+(—1)"s,~190(91>-+->9n-1) 
n-i 


= (Orla ga) + 2, (Oobi afilia 
1 


+(— 1)" gis- -9n-1) 


Hence dy + 1858081; +++» Zn) T Sn- 1dp80lE1 <--> Zn) = Zogi --- Zn). This shows that the third 
equation in (51) holds. Similarly, one verifies the other two equations in (51). 

We can now show that the Z[G]-homomorphisms e, d,, satisfy ed, = 0, d,d, +; = 0, n 2 1. By 
(49), C, is free with base {(g) = 1 ® glg € G}. Since ed,(g) = e(g — 1) = 1 — 1 = 0, the first 
equality holds. Thus if we put € = dp, then we have d,d, +1 = 0 for n = 0. We note also that 
s,C, for n > 0 contains the set of generators {(go, Z1; ..., g,)} for C, + 1 as Z[G]-module. 
Hence it suffices to show that d,d,, +15, = 0 ifn > 0, and we may assume d, _ jd, = 0. 


Then 
d,dy+1S, = 4,(1—s,-14,) = d,—(1—S,-2d,-1)dn = S,-24,-14, = O. 


We can regard (45) as a Z-complex and the sequence of maps s_, Sọ, S}, ... aS a homotopy 
between the chain homomorphism of this complex into itself that is the identity on every C; 


with the chain homomorphism that is 0 on every C;. Then these chain homomorphisms define 


the same homomorphisms of the homology groups. It follows that the homology groups of the Z- 
complex (45) are all 0. This means that (45) is exact and hence we have proved 


tt i 
THEOREM 6.11. Let C, = FLG] @2L6], n > 0, and let e : Cy — Z be the 2[G)- 
homomorphism such that €l = 1, d, : C, —> C, _ 1, the Z[G]- homomorphism such that (50) 
holds. Then C= {C,,}, and e constitutes a free resolution for 2 regarded as trivial G-module. 


To calculate Ext”(Z, A) for any Z[G]-module we can use the resolution C, e. Then Ext”(Z, A) 
is the nth homology group of the complex 


(53) 0 = hom(Co, A) + hom(C,, A) =". 

Since {(g), ..., Z,)lg; € G} is a Z[G]-base for C,, we have a bijection of hom(C,,, A) with the 
set C"(G, A) of functions of n variables in G into A, which associates with any f e hom(C,,, A) 
its restriction to the base {(g), ..., g,)}. The map hom(C,, 4) — hom(C, + 1, A) is right 
multiplication by d, + 1; that is, if fis a Z[G]-homomorphism of C, into A, then its image under 
hom(C,,, 4) — hom(C;, + ;, A) is the homomorphism x ~ fd, + 1x). If we take x to be the 
element (gj, ..., 2, +1) of C, + 1 then this map is 


ises Outi) %S(G1(G25-++sIn41) + Lí- 1) 15+++sGiPit1s+++s9n+1) 
+(—1)°**G;,.--+da)) 


= gif Uz, Jn+1)) + y (= IS (is. +sGiGia1s+++s9n+1) 
i 


= Of 915+++5In+1): 
Thus we have the following commutative diagram 


hom(C,,, 4) > hom, , ,, A) 
(54) | | 
C"(G, S) ——— C"+! (G,A) 


a 


Ò 


where the vertical arrows are group isomorphisms. Since the product of hom(C,_ ,, 4) > 


hom(C,,, 4) and hom(C,,, 4) > hom(C,, + 1, 4) is 0, we have 6* = 0. Hence 0 > CG, A) > 
C!(G, A) > C*(G, A) —> ` is a cochain complex and this is isomorphic to the cochain complex 
0 — hom(Cp, A) —> hom(C), A) — hom(C,, A) — v. It follows that these two complexes have 
isomorphic homology groups. We therefore have 


THEOREM 6.12. B”(G, A) is a subgroup of Z"(G, A) and H"(G, A) = Z"(G, A)/B"(G, A) = 
Ext”(Z, A). 


We shall now switch from the original definition of the cohomology groups of G with 
coefficients in A to the groups Ext’(Z, A). From now on we use the definition Ext” (Z, A) for the 
nth cohomology group of G with coefficients in A. This definition has the advantage that it 
makes available the functorial results on Ext for the study of cohomology groups of a group. 
Also it offers considerably more flexibility since it permits us to replace the resolution of Z 
that we have used by others. Some instances of this will be given in the exercises. 


We shall now look at the cohomology group H”(G, A) for n = 0, 1, 2. We prove first 


THEOREM 6.13. H(G, A) = A®, the subgroup of A of elements x satisfying gx =x, g € G. 


Proof. We recall that Ext9(M, N) = hom(M, N). Hence HG, A) = homg;g)(Z, A), the group 
of Z[G]-module homomorphisms of Z into A. If y is such a homomorphism, 7 is determined by 
n(1) and if 7(1) =x € A, then x = 7(1) = n(g1) = gy) = gx, g € G. Conversely, if x € A 
satisfies gx = x, g € G, then the map 7 such that y(n) = nx is a Z2[G]-homo- morphism of Z into 
A. It follows that hom(Z, A) is isomorphic (under y ~ (1)) to A. O 


We remark that this proposition can also be proved easily by using the definitions of Z°(G, 
A), B°(G, A), and Z°(G, A)/B°(G, A). We leave it to the reader to carry out such a proof. 
If A is a G-module, a map f of G into A is called a crossed homomorphism of G into A if 


(55) f(gh) =gf(h)+fg), — g,heG. 


Ifx € A, then the map f defined by 


(56) f(g) = 9x-x 


is a crossed homomorphism of G into A since 
gf (h) +f (g) = g(hx —x)+gx—x = ghx—x = f (gh). 


A crossed homomorphism defined by (56) is called principal. It is clear that the crossed 
homomorphisms form an abelian group under addition of maps and that the principal ones form 
a subgroup. Comparison with (42) and (43) shows that the first of these groups is Z'(G, A) and 


the second is B!(G, A). Hence the factor group is (isomorphic to) the first cohomology group of 
G with coefficients in A. 

We have encountered crossed homomorphisms in Galois theory in considering Speiser’s 
equations and their additive analogue (BAI, pp. 297—299). We recall these results, the first of 
which constituted a generalization of Hilbert’s Satz 90. Let E be a finite dimensional Galois 
extension field of the field F, G the Galois group of E/F, so G is finite and |G| = [E : F]. We 
have the natural actions of G on the additive group £ of E and on the multiplicative group E* of 
non-zero elements of E. If we consider the additive group of E as G-module, then a crossed 
homomorphism is a map f of G into E such that fgh) = f(g) + gh). Theorem 4.32 of BAI (p. 
297) states that any such crossed homomorphism is principal. Thus we have the result H'(G, 
E) = 0: The first cohomology group of the Galois group G of E/F with coefficients in E is 0. 
Now consider the action of G on E*. Since the composition in E* is multiplication, a crossed 
homomorphism of G into E* is a map f of G into E* such that 


(57) I (gh) = (gf (h))f (g). 


These are Speiser’s equations as given in (75), p. 297 of BAI. Speiser’s theorem is that such 
an f is principal, that is, it has the form f(g) = (gu)u ! for some u € E*. Thus Speiser’s 
theorem is the homological result that H!(G, E*) = 1 (using multiplicative notation). 

If G is a group and 4 is an abelian group written multiplicatively on which G acts by 
automorphisms, then the group C?(G, A) is the group of functions of two variables in G to A 
with multiplication as composition. Z*(G, A) is the subgroup offe C7(G, A) such that 


(58) Fg h)F (gh, k) = (gf (h,k))f (g,hk),  g,h,keG. 


This is clear from (44). Such a map is called a factor set. The subgroup B?(G, A) is the 
subgroup of maps of the form f where f(g, h) = u(g)gu(h)u(gh) ! where u is a map of G into A. 
The group Z*(G, A)/B7(G, A) is the second cohomology group of G with coefficients in A. We 
shall give an interpretation of this group in the next section. 

We shall conclude this section by proving the following result on cohomology groups of 
finite groups. 


THEOREM 6.14. Jf G is a finite group, A a G-module, then every element of H"(G, A), n > 
0, has finite order a divisor of |G]. 


Proof. Let fe C"(G, A) and consider the formula (41) for ôf. We let g, +] range over G and 
sum the corresponding formulas. If we denote 26/1919) by u(g,, ..., Z, — 1), then since 
Local Giy-+-Ga- 1909) = “MGs+----Gu-1), the result we obtain is 


n-i 
Y Ofis- -9n 9) = G1U(G25-065Gn) + X, (HL WG, -99i 19n) 


g i=1 


+(—1)"u(9y,---59n-s)t(— D" IGS Ois- Gn) 


= Ju(gy,--6s9n) +(— 1)" IGS iga). 


Hence if ôf = 0, then |G| f(g, ..., g,) = + du(g, ..., Zn) E B'(G, A). Then |GIZ"(G, A) c BG, 
A), so |G|H"(G, A) = 0, which proves the theorem. O 


An immediate consequence of this result is the 


COROLLARY. Let G be a finite group, A a finite G-module such that (\G\, |A) = 1. Then 
H"(G, A) =0 for every n> 0. 


This is clear since |A| f= 0 for every fe C"(G, A). 


EXERCISES 


1. Let B be a right module for Z[G]. Define the nth homology group of G with coefficients 
in B,n = 0, as H,(G, B) = Tor,(B, Z) where Z is regarded as a trivial G-module. Show 
that H(G, B) = B/Bg where Bg is the subgroup of B generated by the elements xg — x, x 
E B. 


2. Let be the homomorphism of Z[G] into Z defined in the text and let / = ker e. Show that 7 
is a free Z-module with base {g -lle € G,g #41}. 


3. Let A and B be G-modules. Show that A @7B is a G-module with the action such that g(x 
@ y) = gx ® gy. 
4. Let A be a G-module and let 4, denote the abelian group A with the trivial G-action. 


Show that Z[G] ®z A and Z[G] ®z A, are isomorphic as G-modules. (Hint: Show that 


there is a module isomorphism of Z[G] ®z A onto Z[G] @z A, such that g@x — g@g ! x, 


ge G,xEA.) 


5. Use exercise 4 to prove that if A is a G-module that is Z-free, then Z[G] ®z A is Z[G]- 
free. 


6. Let I = ker e, € : Z[G] — Z, as in the text and put "5198:19: 2", n factors. Show that the 
short exact sequence 9+! ~“1¢)~2~°®, where the first map is inclusion, yields a short 
exact sequence ~M = Z[6]®-F—= F0. Let d, be the homomorphism of @[6]@z!" into 2[G]@zr""' 


5.10 


that is the composite of 2[6]@z!" + land r" ~ 2[G]@,!""', Note that 
dlg -1)®-* @G,-1)) = g- 1) @G,-1), 
g; € G. Show that 
0+-Z2¢7[G]“Z[G]@l<Z[Glor—- 


is a free resolution for Z. 


. Let G = @, the cyclic group of finite order m generated by the element g. Note that 


ZEG] = ZEN- ¢ an indeterminate. Let D = g — 1, N=1+ g+ +g”- l, and let D', N' 
denote multiplication by D and N respectively in Z[G]. Show that 


0-Z+Z[G]“Z[G]-Z[G] 2: 


is a free resolution for Z. 


. Use exercise 7 to show that if A is a G-module for the cyclic group of order m < œ, then 


H?"G,A)= A/NA, n>0Q, 
H*"+!(G,A) = (Ann, N)/DA, n> 0, 
where Ann,N= {x € A|Nx =O}. 


EXTENSIONS OF GROUPS 


By an extension of a group G by a group A we shall mean a short exact sequence 


(59) 


lA SE SG ot 


Thus 7 is injective, p is surjective, and ker p = iA. Hence iaae. If1+45£'%G-1 1s a second 
extension of G by A, then we say that this is equivalent to (59) if there exists a homomorphism 
h: E — E' such that 


is commutative. It follows easily, as for modules (p. 348), that in this case h is an 
isomorphism. 


We restrict our attention to extensions of a group G by an abelian group. With such an 
extension, we can associate an action of G on A by automorphisms and an element of the 
cohomology group H?(G, A) where A is regarded as G-module by the action we shall define. It 
will be helpful in these considerations to denote the elements of G by small Greek letters and 
those of A and E by small Latin letters. 

We first define the action of G on A. Leto € G, x € A. Choose an element s € E such that ps 

= g and consider the element s(ix)s~ !. Since i4= £, s(ix)s” | € iA and since i is injective, we 
have a unique element y € A such that s(ix)s~ ! = iy. To obtain y we made a choice of an 
element s € E such that ps = ø. Let s' be a second element such that ps’ = ø. Then p(s's~ ') = 1 
and hence s'™ ! = ia, a € A, and s' = (ia)s. Since iA is abelian, we have 
§(ix)s'~* = (ia)s(ix)s~*(ia)~* = s(ix)s~', Thus the element y is independent of the choice of s and hence 
we can put ox = y. Our definition is 
(61) ox=y where ps=a, sf(ix)s ' = iy. 
It is straightforward to verify that the definition of ox gives an action of G on A by 
automorphisms: (¢1¢2)x = ¢,(o2x), Ix=x, o(xy)=(ex)(ey), Except for the fact that A is written 
multiplicatively, we have defined a G-module structure on A. We shall now call this a G- 
module even though we retain the multiplicative notation in A. 

Our next step is to choose for each o e G an element s, € E such that ps, =o. Thus we have 
a map s : G —> E, o ~ s, such that ps, =o for all o (or ps = 16). 


Let o, t € G and consider the element s, s,s, | of E. Applying p to this element gives 1. 


O ` T OT 


: . Sias 
Hence there is a unique element k, , E€ 4 such that sp Sz So = iko, z or 


(62) 5,5, = (iky See 


Ifp € Galso, then 


: ah. =! E- =" 
Sp(SaS:) = Sy(ikg )Sor os S,(ik,.)8, Sy Sar eA i(k ,,-)S) Sqr 
(ok, UK, a: )Spar a ilp (ka Kya) Spar: 
Similarly, (»50)s: = Kp,okva:)ver. Hence, the associative law in E implies that 


(63) I PP a (Dlg Mis ae 


These relations show that the map k : G x G — A such that (a, t) ~ k, , is an element of Z(G, 


A) as defined in the classical definition of H7(G, A) given at the beginning of section 6.9. 

We now consider the alteration in k that results from changing the map s to a second one s’ 
satisfying ps’ = 1g. Then for any 7€%, pls.s,")= (ps,)(ps,)' = 00" =1, Hence there exists a unique u, 
€ A such that 545; ' =i, ors, = (iu.)s.. Thus we have a map u : o ~ u, of G into A such that 


(64) s% =(iu,)s,, GEG. 


Conversely, if u is any map of G into A, then s' defined by (64) satisfies ps’ = 1g. By (64), we 
have 5,5, = (iu, )s,(iu,)s, = i(u,(ou,) 5,5, = ifta (Otli) ka, t) So = i(Ug (OU) ka ta )Sice . Hence k is replaced by k' where 


(65) kar = Ug(OU Mae ko, 


This shows that k’ and k determine the same element of H?(G, A). Hence the extension 
determines a unique element of H(G, A). 

It follows directly from the definitions that if the extensions !+A4+E£4G-1 and 
|+A5E'%G-1 are equivalent, then they determine the same module action of G on A and the 
same element of H?(G, A). To prove the converse we shall show that the multiplication in Æ is 
determined by the action of G on A and the map k. Let e e E and put °="*®%/= els)" Then 
pf = pe(ps.)' =o '=1, Hence f= ix for a uniquely determined element x € A. Then we have the 
factorization 


(66) =(ix)s,, xe€A, o€G. 


It is clear that the elements x € A, and o € G are uniquely determined by the given element e. 
Now let (iy)S,, where y € A and t € G, be a second element of E. Then 


(67) (ix)s,(iy)s, = i(x(oy))s,8, = i(x(oy)k, -)Sqr: 


Now suppose the extensions 1+A+E£4G-—1 and 1+A+5&%G-—1 determine the same module 
structure and the same element of H?(G, A). Then we can choose maps s and s' of G into E and 
E" respectively such that ps = lo, p's’ = lg and for any o, t €e G we have 
525: = (ike Sor 808. = Cheer, Then we have (67) and “2s: = riley)... Tt follows that the map A 
: (ix)(s,) ~ (ix) (s',) is a homomorphism of E into E’ so that (60) is commutative. Hence the 


extensions are equivalent. 
We shall now state the following basic 


THEOREM 6.15. Two extensions of G by an abelian group A are equivalent if and only if 


they determine the same action of G on A and the same element of H(G, A). Let G be a 
group, A a G-module, and let M denote the set of extensions of G by A having a given G- 
module A as associated module. Then we have a 1—1 correspondence between the set of 


equivalence classes of extensions of G by A contained in M with the elements of H(G, A). 


Proof. The first statement has been proved. To prove the second, it suffices to show that 
given a G-module A and a map k of G x G into A satisfying (63) there exists an extension 
I= 4A-4E45G-1 whose associated action of G on A is the given one and whose element of H7(G, 
A) is the one determined by k. We put E = A x G, the set of pairs (x, 0), x € A, o € G, and we 
define a multiplication in £ by 


(68) (x,a) (y,t) = (x(ay)k,..,07). 


Then it is immediate from (63) that this multiplication is associative. If we put p = o = 1 in 
(61), we obtain ki 1 ky, = ki, z ky, ;- Hence ky, =k, ;. Then 


(kris 1) (y,1) = (kī,iYki 1) = (9,7), 


so |= kiil) is a left unit in E. To prove that E is a group with 1 as unit, it suffices to show that 
any element (x, o) of E has a left inverse relative to 1 (BAI, p. 36, exercise 10). This follows 
from 


kika a 07") (x,0) = k ka (ox) (0 x)k,-1..1) = (kt). 


Hence Æ is a group. Let ix ~ (xkii. D. p20.) ¢, Then i is a homomorphism of A into E and p is a 
homomorphism of F into G. It is clear that 7 is injective and p is surjective. Moreover, iA = ker 
p. Hence 1+ A+E*G-11is an extension of G by A. To determine the module action of G on A 
determined by this extension, we note that p(l, o) = o so we must calculate 
(L,a)ix(l,a) * = (1, a)(xkit, 1,0)" * = (Lexi) (kiika) We have seen that ki, = ky. , and if we 


puto = t= 1 in (63), we obtain Kaiko = WPkiaKai, Hence k, | = pk,,,. Now put o = p` l rt=pin 


(63). This gives Kar thie= Pkr'Kai so kno kis = Pky dot, Thus we have 
(69) ki, ma kii ko = pk; 1, k tk = (Pkp-',p)kp,1: 
Now (1.0) (kiix, 1) = (ok? i)axk, 16) = (6x,0) and 


(ox, 0) (kr kot. 607 *) = ((ox)o(ky 1)” kaia) keel 
= ((ax)k> alk, ok, e-ty!) = (xii) 


p,p 


(by (69)). Hence the module action is the given one. Now let s be the map o ~ (l, o), so ps = 
1g. We have LOUE = kaot) = kiikan Let), so 5 s, = (ik, -)Sg_-Hence the element of H(G, A) 
associated with this extension is that determined by k. This completes the proof. o 


The foregoing result in a slightly different form is classical. It was proved by Schreier, who 
first considered the problem of extensions as the problem of describing all of the groups that 
have a given normal subgroup A (not necessarily commutative) with given factor group G. 

An extension (59) is said to be split if there exists a group homomorphism s : G — E such 
that ps = 1g. Then (so)(st) = sot for any o, t € Gand hence k, , determined by (62) is 1. Thus 
the element of H(G, A) associated with a split extension is 1. Conversely, if this is the case, 
then we have a map s : G — E satisfying ps = 1g for which the k, , are all 1, which means that 
s 1s a homomorphism. Thus the split extensions are those for which the associated element of 
H(G, A) is 1. Evidently this implies that H?(G, A) = 1 if and only if all extensions of G by A 
with the given G-module structure on A split. By the corollary to Theorem 6.14 (p. 361), this is 
the case if G and A are finite and (|G|, |A) = 1. 

If A=E and E/A = G for arbitrary (not necessarily abelian) A, then we have the extension 
|\--A+E%G-1 where i is the injection of A in E and p is the natural homomorphism of E onto 
G. It is readily seen from the definition that this extension splits if and only if there exists a 
subgroup S of E such that E = SA, S N A = 1. In this case E is said to be the semi-direct 
product of S and A. The result just indicated is that if A is abelian and A and G are finite with 
(|A|, |G|) = 1, then E = SA and SM A= 1 for a subgroup S of E. This result, which was obtained 
by homological methods, can be supplemented by a little group theory to prove the following 
theorem due to H. Zassenhaus. 


THEOREM 6.16. Let E be a finite group, A a normal subgroup of E such that (|A|, |E/A) = 
1. Then E is a semi-direct product of A and a subgroup S. 


Proof. Let |A|=™m, |E/A| =n. It suffices to show that there exists a subgroup S such that |S] = n. 
For, if S is such a subgroup, then S N A is a subgroup whose order divides |S| = n and |A| = m. 
Then S N A= 1. Also since 4=£, SA is a subgroup whose order is a multiple of |S] and |4| and 
so is a multiple of mn = |E]. Let p be a prime divisor of m and let H be a Sylow p-subgroup of 
A. Then H is also a Sylow p-subgroup of E. The set Syl, of Sylow p-subgroups of E is {gHg™ 


llo € E} (BAL p. 80). Since 4=E this is also the set of Sylow p-subgroups of A. Hence if N is 
the normalizer of H in £ then |Syl,| = [E : N] = [A : N N A]. Thus 


LEVINI = |Al/|\N ^ A| 
and hence 
n = [E:A] = |E\/\A| = INI/IN a A| = [N:N ^ A]. 


On the other hand, IN ^ 4\\/4) so (N ^ Al. IN/N ^ Al) = 1 If IN] < |El we can use induction on order to 


conclude that N and hence E contains a subgroup of order n. Now suppose |N| = |E]. Then N = E 
and Ha E. The center Z of H is non-trivial (BAI, p. 76). Since Z is a characteristic subgroup of 
H, Za£ and 4/7 E/Z. Since (E/Z)/(A/Z) = E/A we can apply induction to conclude that E/Z 
contains a subgroup L/Z of order n. Since Z is abelian and |Z] is a power of p, the result we 
established by homology implies that Z contains a subgroup S of order n. This completes the 
proof. o 


EXERCISES 


1. Let 1+4A+E4G-1 be an extension of G by the abelian group 4 and let H be the set of 
equivalences h of this extension : the automorphisms of E that make 


commutative. Show that H is a subgroup of Aut E that contains the inner automorphisms 
by the elements of i4. Show that the latter set is a normal subgroup J of H and H/J = 


H\(G, A). 


2. (Schreier, Eilenberg and MacLane.) Let | 4+ £4G-1 be an extension of G by the group 
A (not necessarily abelian) and let Z be the center of A. Let s be a map o ~ s, of G into E 


such that ps, = o and let g(s,) be the automorphism x ~ y of A such that s,(ix)s,- | = iy. 


Show that if s' is a second map of G into E satisfying ps’, = 0, o € G, then g(s’,) € (Inaut 
A(s) and 9(s',)|Z = o(s,)|Z. Show that o : o ~ (Inaut A)g(s,) is a homomorphism of G 
into Aut A/Inaut A and oc = g(s,)c, c € Z, defines an action of G on Z by automorphisms. 


Show that if o, t € G, then s,s, = i(k, 1)Sor where k, , is a uniquely determined element 
of A. Show that if we put g(o) = 9(s,), then 


(70) ((o)p(t) = I, plat) 


where J, _ is the inner automorphism *~ t.4- , in A. Show that 


(71) koa, = = (9(p)ko Kpa: 


and 
(12) o(1) = li 


Conversely, suppose G and 4 are groups and k is a map of G x G into A and g a map of 


G into Aut A such that (70)—(72) hold. Let E = A x G and define a product in E by 
(73) (x,0) (y,t) = (x((o)y)k,,., 07) 


Show that E is a group with unit 7i.) and that if 7 is the map *~7 6*0 and p is the map 
(x, o) ~ g, then!+-A+E4G-1is an extension of G by A. 

Let 1+A4E4G-1 and !747E 56-1 be extensions of G by A such that the associated 
homomorphisms of G into Aut A/Inaut A and hence the associated actions of G on Z are 
the same. Show that the maps s and s’ (for G to £”) can be chosen so that g(s,) = 9'(s',)\(@ 
' defined analogously to g). Let k’, , be defined by %* =‘. and put 1% = 4-4, Show 
that Ao, tT) € Zand 697/9 is a 2-cocycle for G with values in Z (where the action of 
G is as defined before). Use this to establish a 1—1 correspondence between the set of 
equivalence classes of extensions of G by A, all having a fixed associated 
homomorphism of G into Aut 4/Inaut A with H(G, Z). 


3. Let G be finite and let H*(G, €*) be the second cohomology group of G with coefficients 
in the multiplicative group €* of non-zero elements of € where the action of G on C* is 
trivial. The group H*(G, €*) is called the Schur multiplier of G. Use Theorem 6.14 to 
show that if [y] is any element of H(G, €*), then the representative cocycle y can be 
chosen to have values that are nth roots of unity, n = |G|. Hence conclude that H?(G, €*) 
is a finite group. 

4. Let p be a projective representation of a group G. By definition, p is a homomorphism of 
G into the projective linear group PGL(V) of a finite dimensional vector space V/F 
(exercise 4, p. 256). As in the exercise cited, for each g €e G, let u(g) denote a 
representative of the coset p(g) e PGL(V), so = marwa) where yoo, E F*, the 
multiplicative group of F. Show that 7'4%»4)~*%. is a 2-cocycle of G with coefficients in 
F* where the action of G on F* is trivial. Show that if we make a second choice of 
representatives u'(g) € GL(V) for the p(g), g € G, then the resulting 2-cocycle y’ 
determines the same element of H?(G, F*) as y. Hence show that we can associate with p 
a unique element [y] of H(G, F*). Note that if [y] = 1, then we may take y,; 45 = l, g; € 
G. Then we have u(gig>) = u(g)(g2), SO p is essentially an ordinary representation. In 
this case we shall say that p splits. 


5. Let the notations be as in the last exercise and let A be a subgroup of F* containing the 


Vevey for a particular choice of the cocycle y. Construct the extension E of G by A 


corresponding to y as in the text. Write the elements of E as (g, a) g € G, a e A. Then 
(21, 41)(82, a2) = (8182 Yeig24 142). Note that iA = {(1, a)} is contained in the center of £E. 


Show that “.«)= azta) defines a representation of E acting on V such that #.1) = #), 
6. Let E be an extension of G by A such that iA c Z(G), the center of E. Let à be a 


representation of E acting on a finite dimensional vector space over an algebraically 
closed field F. For g €e G define p(g) to be the element of PGL(V) having representative 


fig. DELIV), Show that p is a projective representation of G. 


Note: The preceding exercises 3—6 give a slight indication of a rich connection between the 
Schur multiplier and projective representations of a finite group. This was developed in three 
papers by Schur. The reader may consult Huppert’s Endliche Gruppen, Springer-Verlag, 
Berlin-Heidelberg-New York, 1967, pp. 628—641 for an account of Schur’s theory, with 
references. 


6.11 COHOMOLOGY OF ALGEBRAS 


The definitions of homology and cohomology modules for an (associative) algebra, which are 
due to Hochschild, are based on the concept of a bimodule for an algebra. Let A be an algebra 
over a commutative ring K. We have defined a (left) module M for A (as K-algebra) to be an 
abelian group written additively that is both a K-module and an A-module in the usual ring 
sense such that k(ax) = (ka)x = a(kx), ke K, a € A, xe M (p. 211). One has a similar 
definition for a right (algebra) A-module M : M is a K-module, a right A-module in the usual 
sense such that k(xa) = (Ax)a = x(ka), k € K,a € A,x € M. Now let A and B be algebras over 
K. Then we define an (algebra) A-B-bimodule M to be a left A-module and a right B-module 
such that the K-module structures given by A and by B are the same and (ax)b = a(xb), a € A, b 
e B,xe M. 

There is a simple device for reducing the study of A-B-bimodules to that of modules for 
another algebra. Let B°P be the opposite algebra of B and form the algebra A@B°? where @ 
stands for ®g. Let M be an A-B-bimodule, x an element of M. Then we have the map of A x B 


into M sending (a, b) ~ axb e M. This is K-bilinear, so we have a K-linear map of A ® B into 
M such that ~~ >axb:. The main point of this is that for a given £4®b¢4@8" and a given x 
€e M we have a well-defined product (24.@))s = LaixheM_ Direct verification shows that this 
renders M an algebra A ® B°P-module. Conversely, if M is given as an A ® B°P-module, then ax 
= (a @1)x, xb = (1 @ b)x, kx = kx fora e A, b € B, k e K, make M an (algebra)A-B-bimodule. 
It is clear from this that we can pass freely from the point of view of A-B-bimodules to that of 
A ® B°?-modules and conversely. 

If M and N are modules for the algebra A, a homomorphism y of M into N is a 
homomorphism of abelian groups satisfying n(Ax) = k(yx), n(ax) = a(yx), k € K, a € A. Since 
kx = (kl)x, the first of these conditions is superfluous, so the notion coincides with that of 
homomorphisms of M into N in the sense of modules for the ring A. On the other hand, it is 
natural to endow hom,(M, N) with a K-module structure rather than just the abelian group 


structure we have considered hitherto. This is done by defining ky, k e K, 7 e hom(M, N) by 
(kn)x = k(yx). It is clear that in this way hom(M, N) becomes a K-module. Similarly, if M is a 
right A-module and N is a left A-module for the algebra A, then M ® ,N is a K-module. In place 


of the usual functors hom,(M, —), hom,( —, N), ® 4N, etc., to the category of abelian groups, we 
now have functors to K-modules. Moreover, these are not only additive but also K-linear in the 


sense that the maps between the K-modules involved in the definitions are K-homomorphisms. 
Similar remarks apply to the derived functors. All of this is quite obvious and would be 
tedious to spell out in detail. We shall therefore say nothing more about it and shall replace 
abelian groups by modules in what follows when it is appropriate to do so. 

We now consider a single algebra A and the category of A-A-bimodules (homomorphisms = 
A-A-homomorphisms). Equivalently we have the category of A®,A°?-modules. We shall now 
write Æ for A ® AP. Evidently A itself is an A-A-bimodule relative to the left and right 
multiplications in A. Thus A is an A°-module in which we have 


(74) (Sa,@b,)x = La;xb; 


for a, x € A, b; e A°P. Evidently A is cyclic as A°-module with 1 as generator. Hence we have 


an A°-homomorphism 


(15) e:2a;®b; > (L4,@b;)! = X a;b; 


of A° onto A. 

We are now ready to define the homology and cohomology modules of an algebra A. Let M 
be an A-A-bimodule (= A°-module). Then we define the nth cohomology module H"(A, M) of A 
with coefficients in M as Ext" ,(A, M) and the nth homology module H,(A, M) of A with 


coefficients in M as Tor,“ (A, M). In both cases A is regarded as A°-module in the manner 
defined above. 

We now assume that 4 is K-free (projective would suffice). We shall define a free 
resolution of A as A°-module such that the determination of H”"(A, M) by this resolution can be 
identified with the original definition of H”(A, M) as given by Hochschild for algebras over 
fields. Let Xo = A@xA, X, = A @x A @x A and, in general, X, = A @p @... @g Awithn + 2 
factors. If M and N are A-A-bimodules, then M®,N is an A-A-bimodule in which a(x®y) = ax® 
y, (x@y)a = xƏya, a E Á, x E M, y € N (Proposition 3.6, p. 135). It follows that X, is an A-A- 
bimodule in which 


A(X9® *** OXq41) = 4X08 X1 i Xn 


76 
l ) (X08 OXn+1)4 = Xo *** OXO Xn 414. 


It is clear from the definitions that ¥o54A®xA=A, X, =A@KA®rASAÆA®A and for 
n> l, X,24°@X.2 as Æ-modules. The isomorphism “*®*=2~*» maps 
(4@bH@x~a@x@b, xeX,-2, Since A is K-free, the X, are K-free. It follows that Xo = 4%, X: = 4"®4 


and X, = 4°®@X,-2 are A°-free. 
It is clear from the usual argument with tensor products that we have a unique K- 
homomorphism d, of X, into X, _ ; such that 


n 


(77) diot e @ X pas) =) (— 1X BW @ XX DMA k 


0 
It is clear from the definition of the left and right A-actions that this is an A-A-bimodule, hence, 
A°-homomorphism of X, into X,,_ ;. Together with the augmentation e: Xọ — A we have the 


sequence of 4°-homomorphisms 
i d 
(78) 0-AHeXy- Xe 


We shall show that this is exact, which will prove that (78) is a free resolution of A as A°- 
module. We obtain a proof of exactness in a manner similar to that given for the complex 
employed in the group case (p. 357). We define a contracting homomorphism 


(79) AFX)3X, 7°", 


that is, a sequence of K-homomorphisms s; such that 
(80) es_,=14, dySots_ye=1y,, day 15, +5,-1d, = 1y, m2. 


We define s,,, n =— 1, to be the K-homomorphism such that 


(81) S(x *°* @Xn41) = 1@xp@°** @X_44- 


Then it follows directly from the definition that (80) holds. As in the group case (p. 358), this 
implies that (78) is exact and hence this is a free resolution of A with e as augmentation. 
For a given A-A-bimodule M we now have the cochain complex 


(82) 0 > hom (Xo, M) > hom 4-(X,,M) > 


whose cohomology groups are the cohomology groups of A with coefficients in M. Now we 
have the sequence of isomorphisms 
homad Xn M) = hom AQ X,_2,M) = hom,4X,— 2 @A*,M) = homad Xn- 2, homad A", M)) (see p. 136) = hom,(X,-2,M)_ 
We can also identify hom,(X,, _ 2, M) with the K- module of n-linear maps of A x ... x A, n 
times, into M. Such a map has A x ... x A as domain and M as codomain and is a K- 
homomorphism of A into M if all but one of the arguments is fixed. Hence the isomorphism 
above becomes an isomorphism onto the K-module C”(A, M) of n-linear maps fof A x ... x A 
into M. We now define for f€ C"(A, M), òfe C**'(A, M) by 


Ofra Rees) =A Osdan) 


n 
+ >, (LTS 04 ys wn Midge MIR ay Minder neyp Raed) 


i=1 
Leis A 
abel — Sise XXt is 
x; € A. Then one can check the commutativity of 
hom 4-(X,,,M) > hom 4-(X,,,,,M) 


(84) | | 
C"(A,M) > C"*'(A,M) 


where the vertical maps are the indicated isomorphisms. It follows that 


(85) 0> C°(A,M) > CHA, M) 9° 


is a cochain complex isomorphic to (82). Hence it has the same cohomology groups and 
consequently we have the following 


THEOREM 6.17. Let A be an algebra over K that is K-free, M an A-A-bimodule, C"(A, M), 
n > 0, the K-module of n-linear maps of A x ... x A, n times, into M. For f € C"(A, M) define 
ôf € C" * (A, M) by (83). Let Z"(A, M) = ker 6 on C"(A, M), B"(A, M) = 6C"~ !(A, M). Then 
B"(A, M) is a submodule of Z"(A, M) and Z"(A, M)/B"(A, M) = H"(A, M), the nth cohomology 
module of A with coefficients in M. 


Although some of the results we shall now indicate are valid without this restriction, we 
continue to assume that A is K-free. Following the pattern of our discussion of the group case, 


we now consider H”(4, M) for n = 0, 1, 2, using the determination of these modules given in 
Theorem 6.17. 

As usual, it is understood that C°(A, M) is identified with the module M. Taking u € M, the 
definition of du gives (du)(x) = xu — ux, x € A. Hence Z°(A, M) is the submodule of M of u 
such that ux = xu, x € A. Since C '(4, M) = 0, we see that H°(4, M) is isomorphic to the 
submodule of M of u such that ux = xu, x € A. 

Next let fe C! (4, M). Then (9) = f0)-foy)+fOoy, %yeA and df= 0 if and only if fis a 
K-homomorphism of A into M such that 


(86) f(xy) = xf (vy) +f (x)y. 


It is natural to call such an fa derivation of A into the A-A-bimodule M. Ifu € M, u determines 
the inner derivation ou such that 


(&7) (du)(x) = xu— ux 


These form a submodule Inder(4, M) of the module Der(A, M) of derivations of A into M. The 
special case n = 1 of Theorem 6.17 gives the isomorphism 


(88) H'(A, M) = Der(A, M)/Inder(A, M). 
Now let fe C(A, M). Then 


(Of) (x,y,z) = xf (y, 2) -f (xy, 2) +f (x, 2) -—f (x, y)z 
and ôf = 0 if and only if 


(89) xf (y,z) +f (x, yz) =f (xy,z) +f (x, y)z, 


x, y, Z € A. The set of fe C’(A, M) satisfying this condition constitutes Z7(A, M). This contains 
the submodule of maps ôg, g € C(A, M) and (dg)(x, y) = xg(v) — g(xy) + g(x)y. The quotient of 
Z’(A, M) by this submodule is isomorphic to H(A, M). 

The second cohomology group of an algebra in the form Z7(A, M)/B?(A, M) made its first 
appearance in the literature in proofs by J. H. C. Whitehead and by Hochschild of a classical 
structure theorem on finite dimensional algebras over a field: the so-called Wedderburn 
principal theorem. We shall give a sketch of a cohomological proof of this theorem, leaving the 
details to be filled in by the reader in a sequence of exercises at the end of the chapter. 

Let A be a finite dimensional algebra over a field F, N= rad A. Then N is a nilpotent ideal 
in A and 4 = A/N is semi-primitive. The Wedderburn principal theorem asserts that if 4 is 
separable in the sense that 4" is semi-primitive for every extension field E/F, then A contains a 
subalgebra S such that A = S + Nand SM N= 0, that is, A= S ® Nas vector space over F (not 
as algebra direct sum!). 

To prove the theorem one first reduces the proof to the case in which M = 0. This is done 
by introducing B = A/M? (N° is an ideal) whose radical is M/M and (N/N’)* = 0. If M + 0, then 
the dimensionality [B : F] < [A : F] so we may assume that the theorem holds for B. It follows 
easily that it holds also for A. 

Now assume M = 0. We can choose a subspace V of A such that A = V @ N. This is 
equivalent to choosing a linear map s of 4 into A such that ps = 14 for p, the canonical map x ~ 
x + N ofA onto å. Then V = så and s is injective. Any element of A can be written in one and 


only one way as s(@)+x. ded, xe, Since N is an ideal and M? = 0, we have the multiplication 
(90) (s(@) +x) (s(b) + y) = s(a)s(b) + s(a)y + xs(b). 
If we define @=sla)x,xa=xsia), then MN becomes an  A-A-bimodule. Since 


ps(a)s(b) = ps(a)ps(b) = ab and ps(ab) = ab. we have 


(91) s(@)s(b) = 5(ab) + f(a, b) 


where /äbeN, The map fe Z(a, N). Replacing s by the linear map t: 4 > A such that pt = 14 
replaces f by a cohomologous cocycle. Moreover, if f= 0 in (91), then S = s(A) is a subalgebra 
such that A = S ® N. Then Wedderburn’s principal theorem will follow if we can prove that 
H(A, N) = 0 for any separable algebra 4 and any 4-4-bimodule N. A proof of this is indicated 
in the following exercises. 


EXERCISES 


— 


. First fill in the details of the foregoing argument: the reduction to the case MŽ = 0, the 
reduction in this case to the proof of H?(4, N) = 0. 

2. Let A be a finite dimensional separable algebra over a field F. Show that there exists an 

extension field E/F such that 4c = M, (E)®M,(E)® = M.E), (The easiest way to do this is to 


use the algebraic closure F of F as defined in section 8.1. However, it can be done also 
without the algebraic closure.) 


3. Use exercise 2 to show that A° = A®,,A° is finite dimensional semi-simple. 


4. Show that any module for a semi-simple artinian ring is projective. 


5. Use exercises 3 and 4 to show that if A is finite dimensional separable, then A is a 


projective A°-module. Hence conclude from Theorem 6.6, p. 347, that H”(4, M) = 0 for 
any n > | and any M. (This completes the proof of the theorem.) 


6. (A. I. Malcev.) Let A= S ® N where N= rad A and S is a separable subalgebra of A. Let 


T be a separable subalgebra of A. Show that there exists a z e N such that (1 — z)7( — 
zy) 'cS. 


7. Let A be an arbitrary algebra and let J = ker e where e is the augmentation A° — A 
defined above. Show that J is the left ideal in A° generated by the elements a® 1 — 1 @a. 


8. Show that A is A°-projective if and only if there exists an idempotent e € A®° such that (a 
@lle=(1 @a)e,ae A. 


6.12 HOMOLOGICAL DIMENSION 


Let M be a (left) module for a ring R. There is a natural way of defining homological 
dimension for M in terms of projective resolutions of M. We say lhat M has finite homological 
dimension if M has a projective resolution C, e for which C, = 0 for all sufficiently large n. In 


this case the smallest integer n such that M has a projective resolution 


0>0>C, > Co >M 0 


is called the homological dimension, h.dim M, of M. It is clear from this definition that M is 
projective if and only if h.dim M = 0. We recall that such a module can be characterized by the 


property that Ext”(M,N) = 0 for all modules N and n > 1. The following result contains a 
generalization of this criterion. 


THEOREM 6.18. The following conditions on a module M are equivalent: 

(1) hdim M <n. 

(2) Ext’ * (M, N) = 0 for all modules N. 

(3) Given an exact sequence 0 N C, > C, -1 —> Co > M — 0 in which every C, k <n, 

is projective, then C, is projective. 
Proof. (1) = (2). The hypothesis is that we have a projective resolution 
0+ 0+ C, >» Co + M = 0.. Then we have the complex 0 — hom(Co, N) —> hom(C,, N) > 
.. > hom(C,, N) —> 0 — .... The cohomology groups of this cochain complex are the terms of 


the sequence Ext?(M, N), Ext (M,N), Evidently we have Ext” * '(M, N) = 0 and this holds for 
all N. 


(2) =(3). If we are given an exact sequence with the properties stated in (3) , we obtain 
from it a sequence of homomorphisms 


03C,7D,>C,-; 7 D,-; 7°" 27 Co 7D) 70 


where D, = 1m(C; —> C, 1) for k > 0, Do = im(Co — M) = M, C} — Dy, is obtained from C, 
— C, — ; by restricting the codomain and D, — C, _ | is an injection. Then 0 > D; —> C, - —> 


n-1 


D, — ; — 0 is exact. Hence the long exact sequence for Ext in the first variable gives the 
exactness of 


Ext'(C,_,,N) > Ext'(D,,N) > Exti* "(D, 4, N) 
— Ext'*!(C,_,,.N) 


for i = 1, 2,..., 1<k<n. Since C, _ , is projective, the first and last terms are 0. Thus Ext (D, 
N) = Ext’ *!(D, _ ,, N) and hence 


Ext!(D,,N) = Ext?(D,_,,N) = °:: = Ext"* (Do, N). 


Assuming (2), we have Ext” *! (Do, N) = 0. Hence Ext!(D,,, N) = 0. Since 0 > C, > C,_1 > 
... is exact, D, = C,,. Thus Ext! (C,,, N) = 0 for all N, which implies that C, is projective. 


(3) +(1). The construction of a projective resolution for M gives at the (n — 1)-st stage an 
exact sequence 0 <- M< Co <... — C, _ , where all of the C, are projective. Let C, = 


ker(C,, _ | — C, - 2). Then we have the exact sequence 0 — M > Co < ... — C, > 0. 
Assuming (3), we can conclude that C, is projective. Then 0 — M — Cyp<-#<C,<0< 0 
+ is a projective resolution, which shows that h.dim M <n. O 


Remarks. The proof of the implication (1) = (2) shows also that if h.dim M < n, then Ext 


K(M, N) = 0 for every k <n and every module N. In a similar manner the condition implies that 
Tor,(M, N) = 0 for all k > n and all N. It is clear also that if h.dim M = n, then for any k < n 


there exists a module N such that Ext*(M, N) £0. 


It is clear from the fact that Ext”( —, N) is an additive functor from R-mod to Ab that Ext 
(M'®M",N) = Extt(M', N)S®Ext*(M”, N), An immediate consequence of this and Theorem 6.18 is 
that M = M'® M" has finite homological dimension if and only if this is the case for M' and M 
". Then h.dim M is the larger of h.dim M’ and h.dim M”. We now consider, more generally, 
relations between homological dimensions of terms of a short exact sequence 0 — M'— M —> 
M" — 0. For any module N we have the long exact sequence 


+++» Ext'(M", N) > Ext"(M, N) > Ext*(M’, N) > Ext**"(M",N) >+. 


Suppose h.dim M < n. Then and hence Ext*(M',N) = Extt*'(M",N) if k>n, Similarly, if h.dim M' 
< n, then Ext“ 1(M", N) = Ext**'(M,N) for k>n and if hdim M" < n, then 
Ext*(M, N) = Ext*(M’, N) for k > n, These relations imply first that if any two of the three modules 
M, M', M” have finite homological dimension, then so hasd the third. Suppose this is the case 
and let h.dim M = n, h.dim M’ = n', h.dim M” = n". It is readily seen that the facts we have 
noted on the Ext’s imply that we have one of the following possibilities: 


” 


Ln<n'n". Then either n =n'=n"orn<n'andn"=n' +1. 
I. n' <n" n’<n. Thenn =n" 
M. n”<n' n" <n. Thenn=n"'. 


From this it follows that ifn > n', then n" = n; ifn < n', then n” = n + 1; and ifn = n', then n” 
< n' +1. We state these results as 


THEOREM 6.19. Let 0 — M'— M — M" — 0 be exact. Then if any two of h.dimM', 
h.dimM, h.dimM" are finite, so is the third. Moreover, we have h.dim M" = h.dim M if h.dim 
M' < h.dim M, h.dim M” = h.dim M' +1 if h.dim M < h.dim M’, and h.dim M" < h.dim M + 1 if 
h.dim M = h.dim M". 


The concept of homological dimension of a module leads to the definition of homological 
dimensions for a ring. We define the left (right) global dimension of a ring K as sup h.dim M 
for the left (right) modules for R. Thus the left (right) global dimension of R is 0 if and only if 
every left (right) R-module is projective. This is the case if and only if every short exact 
sequence of left (right) modules 0 — M'— M — M" — 0 splits (p. 150) and this happens if 
and only if every left (right) module for R is completely reducible. It follows that R has left 
(right) global dimension 0 if and only if R is semi-simple artinian (p. 208). Thus R has left 
global dimension 0 if and only if it has right global dimension 0. Otherwise, there is no 


connection between the left and right global dimensions of rings in general. We shall be 
interested primarily in the case of commutative rings where, of course, there is no distinction 
between left and right modules and hence there is only one concept of global dimension. 

A (commutative) p.i.d. R that is not a field has global dimension one. For, any submodule of 
a free K-module is free (exercise 4, p. 155). Hence if M is any R-module, then we have an 
exact sequence 0 — K — F — M — 0 in which F and K are free. Hence h.dim M < 1 for any 
R-module M and the global dimension of R is < 1. Moreover, it is not 0, since this would imply 
that R is semi-simple artinian and hence that R is a direct sum of a finite number of fields. 
Since R has no zero divisors + 0, this would imply that R is a field, contrary to assumption. 


EXERCISE 


1. Let M be a module over a commutative ring K, L a commutative algebra over K that is K- 
free. Show that h.dim,M = h.dim; M; (h.dim,M = homological dimension as K-module). 


6.13 KOSZUL’/S COMPLEX AND HILBERT’'S SYZYGY THEOREM 


We shall now consider homological properties of the ring r = F[x),..., Xm] of polynomials in 
indeterminates x; with coefficients in a field F. Our main objective is a theorem of Hilbert that 


concerns graded modules for the ring R, graded in the usual way into homogeneous parts. We 
consider first the decomposition R = F ® J where J is the ideal in R of polynomials with 0 
constant term, or, equivalently, vanishing at 0. This decomposition permits us to define an R- 
module structure on F by (a + f/)b = ab for a,b € Ff €e J. Note that this module is isomorphic 
to R/J. An important tool for the study of the homological properties of R is a certain resolution 
of F as R-module, which was first introduced by Koszul in a more general situation that is 
applicable to the study of homology of Lie algebras—see the author’s Lie Algebras, pp. 174— 
185). 

Koszul’s complex, which provides a resolution for F, is based on the exterior algebra E(M) 
for a free module M of rank m over R = F[x,... ,x,,]. We need to recall the definitions and 
elementary facts on exterior algebras that we obtained in Chapter 3 (p. 141). Let K be an 
arbitrary commutative ring, M a K-module, and E(M) the exterior algebra defined by M. We 
recall that M is embedded in E(M) and E(M) is graded so that E(M) = K ® M 8 M?®... We 
recall also the basic universal property of E(M), namely, x? = 0 for every x € M, and if fis a 
K-homomorphism of M into an algebra A over K such that f(x)? = 0, then f can be extended in 
one and only one way to a K-algebra homomorphism of E(M) into A. In particular, the map x ~ 
— x in M has a unique extension to a homomorphism / of E(M) into itself and since x + — x is 
of period two, i7 = 1 E(M). 

We shall call a K-endomorphism D of E(M) an anti-derivation if 


(92) D(ab) = (Da)b + a(Db), 


We require the following 


SQ 
I 
= 


LEMMA 1. Let D be a K-homomorphism of M into E(M) such that 
(93) x(Dx) = (Dx)x,  xeM. 


Then D can be extended in one and only one way to an anti-derivation of E(M). 


fsx f 0 
dii Dx a 


of M into the algebra A = M,(E(M)). The condition on D implies that f is a K-homomorphism 


such that Ax)? = 0. Hence fhas a unique extension to an algebra homomorphism of E(M) into A. 
It is clear that the extension has the form 


Proof. Consider the map 


a 0 
fias Da al ae E(M) 


where a = la and D is a K-endomorphism of E(M) extending the given D. The condition f(ab) 
= f(a) f(b) implies that D is an anti-derivation. The uniqueness of the extension follows from 
the following readily verified facts: 


1. The difference of two anti-derivations is an anti-derivation. 
2. The subset on which an anti-derivation is 0 is a subalgebra. 
3. M generates E(M). 
Now if D, and D, are anti-derivations such that D, | M = D, | M = D, then D} — D, is an anti- 


derivation such that (D; — D,) | M = 0. Since M generates E(M), we have D} — D, = 0 and D, = 
D,. O 


A particular case in which the lemma applies is that in which D = d e M*, the K-module of 
K-homomorphisms of M into K (the dual module of M). Since K c E(M), d can be regarded as 
a K-homomorphism of M into E(M). Since K is contained in the center of E(M), it is clear that 
the condition (dx)x = x(dx) of the lemma is fulfilled. Hence we have the extension d that is an 
anti-derivation of E(M). Since dM c K and d is an anti-derivation, we can prove by induction 
that dM‘ c M! !, i > 1. We prove next that if d e M*, the anti-derivation extension d satisfies 


LEMMA 2. d=0. 


Proof. Itis clear from (90) that D1 = 0 for any anti-derivation. Hence dk = 0 for k e K. Then 
dM = 0 since dM c K. We note next that dx = dx =— d X for x € Mand ifa € E(M) satisfies 
da = — dä, then 4(ax) = —d(a*) follows from the fact that d is an anti-derivation. It follows that da 
=~ da for all a. This relation implies that d?M' = 0, by induction on i. Hence œ = 0. O 


We now have the chain complex 


(94) 0+K& MEM: 


determined by the element d € M*. 
We shall require also a result on change of base rings for exterior algebras. 


LEMMA 3. Let M be a module over a commutative ring K, L a commutative algebra over 
K, and let E(M) be the exterior algebra over M. Then E(M), = E(M,). 


Proof. Since M is a direct summand of E(M), we can identify M; with the subset of E(M); of 
L L 


elements È}®x; l€ L, xe M = E(M), We have x = 0 and hence ¥4tyyx = +x) -x-xi =O for Xi, X; 
e M. This implies that (X; ® x)? = 0 for every YJ, ® x; € Mz. Let J be an L-homomorphism 
of M; into an algebra 4/ L such that /( X )*=0, ¥ € M;. Now 3 becomes an algebra over K 
if we define kä keK, deA, as (kI) (kleL), Ifx e M, then f:x~f08x) is a K-homomorphism of M 
into 4/ K such that f(x)? = 0. Hence this has an extension to a K-homomorphism f of E(M) into 4 
/ K. Then we have the homomorphism 1 ® f of E(M), into (4/ K); and we have the L-algebra 


homomorphism of (4 / K); into 4/ L such that / ® à~ la. Taking the composite we obtain an L- 
algebra homomorphism of E(M), into 4/ L. This maps the element 1 ® x, x € M, into f(x) = J 
(1 ® x). Hence it coincides with the given L-homomorphism / on Mz. Thus we have obtained 
an extension of J to an algebra homomorphism of E(M), into 4/ L. Since M generates E(M), 
M,, generates E(M),. Hence the extension is unique. We have therefore shown that E(M)z, has 
the universal property characterizing E(M), and so we may identify these two algebras. O 


We now specialize K to R = F[x),..., Xm] where F is a field and the x; are indeterminates. 
Let V be an m-dimensional vector space over F, (),..., Ym) a base for V/ F, E(V) the exterior 


algebra determined by V. Then EV) = F1 8 V- v", v""* =0, and V” has the base of () elements 
Vj, «++ Yı, where i) < iy < ... < i, (BAI, p. 415). Let M = Vp. Then by Lemma 3, 
E(M)=R1®@M@-~@M" where M” has the base {y;, ... y; } over R. Thus this module is free of 


rank (") over R. (This can also be seen directly by using the same method employed in the case 
VI F.) Let d be the element of M* = homp(M, R) such that d, =x, 1 <i <m, and let d also 


denote the anti-derivation in E(M) extending d. Then we have the chain complex (94) and we 
wish to show that if € is the canonical homomorphism of R into F obtained by evaluating a 
polynomial at (0,..., 0), then 


is a resolution of F as R-module with ¢ as augmentation. Since F c R c E(M) and E(M) is a 
vector space over F, we can extend e to a linear transformation £ in E(M)/ F so that e(M’) = 0, 
i > 1. Then ek(M) = eR = F and so de = 0. Also since dE(M) c dM + 9’; > ¡M! and dM is the 
ideal in R generated by the elements x; = dy, we have edM = 0 and so edE(M) = 0. Thus we 
have 


(96) ed = 0 = de. 


Also since J; > Mi is an ideal in E(M) and e | R is an F-algebra homomorphism of R into F, 
we have 


(97) elab) = (ea) (eb) 


for a, b e E(M). Thus £ is an F-algebra homomorphism of E(M). 
The proof that (95) is exact is similar to two other proofs of exactness that we have given 
(p. 359 and p. 373). It is based on the following 


LEMMA 4. There exists a linear transformation s in E(M)/ F such that 
(98) sd+ds = 1 —e. 


Proof We use induction on m. If m = 1, then M = Ry and E(M) = R1 ® Ry. Since (1, x, x’,...) is 
a base for R = F[x] over F, (1, x, x7,..., Y, XV, X7y,...) is a base for E(M)/ F. We have dx! = 0, 
dx'y =x'*!7>0, and el = 1, ex’! = 0, ex'y = 0. Let s be the linear transformation in E(M)/ F 
such that s1 = 0, sx! =x!~ ty, sx'y = 0. Then it is readily checked that (98) holds. We note also 
that 


(99) ss=Q=¢65, w=e=Eel, sitis=0 


if / is the automorphism defined before. Now assume the lemma for m — 1 > 0. Let E} be the F- 
subspace of E(M) spanned by the elements ¥:*.x:*7: k = °, and Æ, the F-subspace spanned by the 
elements “°° 5y. where k, 2 0 and 2<4 <i<° <i <m, Then it is clear by looking at bases 
that Æ; is a subalgebra isomorphic to E(M,), M, = Ry, E, is a subalgebra isomorphic to 
E(M>), My = 2">Ry;, and we have a vector space isomorphism (not algebra isomorphism) of 
E; @pE, onto E(M) such that u ® v ~ uv. We note also that E} and £, are stabilized by / and by 
d and the induced maps are as defined in E(M). Using induction we have a linear 
transformation s, in Æ, such that sad + ds, = l — e. Let sı be the linear transformation in £ as 
defined at the beginning of the proof. Since E(M) = = £:@r£2, there exists a unique linear 
transformation s in E(M) such that 


(100) s(uv) = (s,u)v + (eu) (sav) 
for u € E, v € E>. Then 
ds(uv) = (ds,u)v + (is,u) (dv) + (deu) (sav) 


+ (ieu) (dsv) 


(ds, u)v + (1s,u) (dv) + (eu) (dsv) 


sd(uv) = s((du)v + (1u)dv) 


(siduv + (edu) (sav) + (8,1) (dv) + (eru) (sadv) 


= (s,du)v + (s,1u) (dv) + (eu) (szdv). 
Hence 


(ds +sd) (uv) = ((1—e)u)v + (eu) ((1—e)v) 
= uv—(eu)v + (eu)v — (eu) (ev) 


= (1—«) (uv) 
since € is an algebra homomorphism. This completes the proof. o 


We can now prove that the sequence (95) is exact. First, we know that € is surjective, so R 
— F — 01s exact. We have seen also that dM is the ideal in R generated by the x;. Since this is 
the ideal of polynomials vanishing at (0, 0, ..., 0), we have dM = J = ker £. Hence M “RF +0 
is exact. It remains to show that if z; € M’, i > 1, and dz; = 0, then there exists aw;,,¢ M'*! 
such that dw; , ; = Z; We have z; = 1z; = (1 — €)z; = (sd + ds)z; = d(sz;) = dw, w = sz; Now we 


can write ¥=Lo"» we", then dw, € Mİ- ! so dw =z; gives dw; ; =z; Thus (95) is exact. 


We summarize our results in 


THEOREM 6.20. Let R= F[x,..., Xml; x; indeterminates, F a field, and let M be the free 
R-module with base ())1,..-, Ym), E(M) the exterior algebra defined by M. Let d be the anti- 
derivation in E(M) such that dy; = x; 1 < i < m, e the ring homomorphism of R into F such 
that ef = f(0,..., 0). Then dM' c MË- ! and is a free resolution of F as R-module. 


Oe FeRAM M"A 0 


We call this resolution the Koszul resolution of F as jR-module and the complex 


M"(=R)—M'—M* =~ the Koszul complex for R. 


Since M* = 0 for k > m, we evidently have h.dim F < m. We claim that, in fact, h.dim F = m. 
This will follow from one of the remarks following Theorem 6.18, by showing that Tor,,*(F, 


F) £0. More generally, we determine Tor,*(F, F) by using the Koszul resolution of the second 
F. Then we have the complex 


(101) 0+ F@pR-FQ,M—F@pM? + 


whose homology groups give Tor ¿Ê(F, F), Tor *(F, F),.... Now M'=R@,V' where V is the m- 


dimensional vector space over F (as above) and * ©r M =F 8r (R ©; V) S (FOR); V S FO, V S V, 
If we use the base Wilh <i: < <4) for M. i<r<m, as before, we can follow through the 
chain of isomorphisms and see that the isomorphism of F ®pM" onto V"/ F sends #8. y: for 
ae Fife R into vi f= FO... The definition of d gives 

r 

’ eee — itii re , E Y oea eee 

(102) e me a a t Ve Vay Pee 

1 
Hence 84) (1® 1y: y.) >0 under our isomorphism. Thus the boundary operators in (101) are all 


0 and the isomorphism F ®g M” — V” gives the F-isomorphism 


(103) Tor,¥(F, F) = 


In particular, Tor, ’(F. ,P)=V =F. 

The result just proved that h.dimF = m implies that R = F[x1,..., Xm] has global dimension > 
m. It is not difficult to supplement this result and prove that the global dimension of R is 
exactly m. We shall indicate this in the exercises. In the remainder of the section we shall 
consider a somewhat similar result on free resolutions of graded modules for R that is due to 
Hilbert. Of course, Hilbert had to state his theorem in a cruder form than we are able to, since 
the concepts that we shall use were not available to him. 

We recall first the standard grading of R = F[x),..., Xn] as 


(104) F[xi Xn] = RO(=FI)OROORO G 


where R is the subspace over F of i-forms, that is, F-linear combinations of monomials of 
total degree i in the x’s. We have RORO) c RE +), 

If R is a graded ring, graded by the subgroups R® of the additive group 
(R = ROR", RRP = R"*») then an R-module M is said to be graded by the subgroups M®, i 
=0, 1,..., of its additive group if Y = “OM OM"@ and ROM c MC +D for every i, j. The 
elements of M are called homogeneous of degree i. A submodule N of M is called 
homogeneous if N= ¥(NN M®), or equivalently, if v= Yv € N where v® e M©, then every 


vO e N. Then N is graded by the submodules N® = N N M®. Moreover, M / N is graded by the 
subgroups (MÔ + N)/ N and (M°+NYN=M°(MON), If M and N are graded R-modules, a 
homomorphism yn of graded modules (of degree 0) is a homomorphism in the usual sense of R- 
modules such that yM® c N® for every i. Then the image 7(M) is a homogeneous submodule 
of N and ker 7 is a homogeneous submodule of M. 

IfM is a graded module for the graded ring R, we can choose a set of generators {u,} for M 
such that every u, is homogeneous (e.g., {u,,} = UM). Let {e,} be a set of elements indexed 
by the same set /= {a} as {u,} and let L be a free R-module with base {e,}. Then we have the 
homomorphism ¢ of L onto M such that e, + u,. Let LY be the set of sums of the elements of the 
form ae, where aY) e RO) and j + deg u, = i. Then it is readily seen that L is graded by the 
L® and it is clear that the epimorphism e is a homomorphism of graded modules. It is easily 


seen also that if N is a graded module and y : M—+*N is a homomorphism (of graded modules), 
then there exists a homomorphism ¢ such that 


L 


is commutative. 

We now suppose that M is a graded module for R = F[x),..., Xml. Let L = Lo be a free 
graded module for which we have an epimorphism € of Lọ onto M and let Ky = ker £. Then Ko 
is graded and hence we can determine a free graded module L, with an epimorphism ¢, of L} 
onto Kp. Combining with the injection of Kp into Lọ we obtain d):L, -— Lo. Again, let K, = ker 
d; and let L, £ be a free graded module and epimorphism of L, onto Kj, d, the composite of 
this with the injection of K, into L,. We continue this process. Then Hilbert’s syzygy theorem 
states that in at most m steps we obtain a kernel K; that is itself free, so that one does not have 
to continue. In a slightly different form we can state 


HILBERT’S SYZYGY THEOREM. Let M be a graded module for R = F[x),..., Xml, the 


ring of polynomials in m indeterminates with coefficients in afield F Let be an exact 
sequence of homomorphisms of graded modules such that every L; is free. Then K,, is free. 


(rN a ae ae ee S A 


The proof will be based on the result that h.dim F < m and two further results that we shall 
now derive. As before, F is regarded as an R-module in which JF = 0, where J is the ideal of 


polynomials vanishing at (0,..., 0). Since J = X; > ọ RË, F becomes a graded R-module if we 
put F® = F, F) =0 for i > 1. We now prove 


LEMMA 5. 7/M is a graded R-module and M ®;pF = 0, then M = 0. 


Proof. We have the exact sequence 0 — J — F — 0. This gives the exact sequence 
MrJ >MOrR=>M®:F=0., Since M@e8=™ and M @pF = 0, by hypothesis, we have the 
exactness of M @p J — M — 0. The map here sends an element Xu; @ f; into X fiu; so the 
exactness means that every element of M can be written in the form X f;u; where the u; e M and 
the f; e J. It follows that M = 0. For, otherwise, let u be a non-zero of M of minimum degree ( 
= highest degree of the homogeneous parts). Then u = X fiu; f, u; E€ M, f; and u; homogeneous. 
Since the degrees of the f; are > 0 we have a u; # O with deg u; < deg u, contrary to the choice 
ofu. Hence M=0. O 


The key result for the proof of Hilbert’s theorem is 


THEOREM 6.21. IfM is a graded module such that Tor,*(M, F) = 0, then M is free. 


Proof We can regard M ® pF as vector space over F by restricting the action of R to F. Then if 


ape F and u e M, *4®f) = © = «Sh. Tt follows that the elements of the formu ® 1, u € M, u 
homogeneous, span M @;F as vector space over F and so we can choose a base for M ® pF of 


the form {u;@ 1 | u; E M}. Let L be a free R-module with base {b;}. We shall prove that M is 
free with base {u;} by showing that the homomorphism 7 of L into M such that yb; = u; is an 
isomorphism. Let C be the cokernel of 7. Then we have the exact sequence L+M=C +0 and 
hence we have the exact sequence LOF MSF ™= COF +0, The map 7 ® 1 sends b; ® 1 into u; 
® | and since {u; @ 1} is a base for M ®pF, y ® 1 is an isomorphism. Hence C @pF = 0 and 
so, by Lemma 5, C= 0. This means that 7 is surjective and LM —0 is exact. Let K now denote 
ker 7. Then we have the exact sequence 0 — K — L — M — 0, which gives the exact 
sequence Tor *(M, F) > K RF — L @pF *®. M @pF. By hypothesis, Tor *(M, F) = 0, and 
we have seen that 7 ® 1 is an isomorphism. Hence K ® p F = 0 and so, again by Lemma 5, K = 
0. Then, ker 7 = 0 and 7 is an isomorphism. O 


Since any projective module satisfies the hypothesis Tor;*(M, F) = 0 (Theorem 6.10, p. 
354), we have the following consequence of Theorem 6.21. 
COROLLARY. Any projective R-module for R = F{x,..., X„] that is graded free. 


We can now give the 


Proof of Hilberfs syzygy theorem. The argument used in the proof of the implication (2) =>(3) 
in Theorem 6.18, p. 376, shows that Tor,*(K,,, F) = Tor®,(M, F). Since hdim F < m, 
Tori. i(M.F)= 0 by one of the remarks following Theorem 6.18. Hence Tor;*{K,,, F) = 0 and so 
K,, 1s free by Theorem 6.21. O 


EXERCISES 


1. Let K be a commutative ring of finite global dimension m and let K[x] be the polynomial 
ring in an indeterminate x over K. Let M be a K[x]-module and 7 = K[X] ®@,M. Note that 


any element of 7 can be written in one and only one way in the form 18u +x8u, + @u + 

Note that there is a K[x]-homomorphism 7 of s# onto M such that f(x) @u ~ f(x)u for fs) 
e K[x], u e M. Show that N = ker yn is the set of elements (x ® ug — 1 @ xu) 
+(x? @ uy —1@ x?uy) + HOO u11 O x*uy_4)4 and that the map 
1@ lo +X @ uy +? Buz +--+ (x @ uol O xu) +(x? Om-Om) 1s a K[x]-isomorphism, so we have 
an exact sequence of K[x|-homomorphisms 0+-“+M+m-o0 By exercise /, p. 278, 
h.dimg;,) = h.dimy M < m. Hence, by Theorem 6.19, h.dimg,,,M@ < m+ 1 and the global 


ea of K[x] < m +1. 


2. Prove that if F is a field and x; are indeterminates, then the global dimension of F[x),..., 
Xn] is m. 
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Commutative Ideal Theory: 
General Theory and Noetherian Rings 


The ideal theory of commutative rings was initiated in Dedekind’s successful “restoration” of 
unique factorization in the rings of algebraic integers of number fields by the introduction of 
ideals. Some of these, e.g., “lv -5] are not factorial, that is, do not have unique factorization of 
elements into irreducible elements (see BAI, pp. 141 — 142). However, Dedekind showed that 
in these rings unique factorization does hold for ideals as products of prime ideals (definition 
on p. 389). A second type of ideal theory, which was introduced at the beginning of this century 
by E. Lasker and F. S. Macaulay, is concerned with the study of ideals in rings of polynomials 
in several indeterminates. This has obvious relevance for algebraic geometry. A principal 
result in the Lasker-Macaulay theory is a decomposition theorem with comparatively weak 
uniqueness properties of ideals in polynomial rings as intersections of so-called primary 
ideals (definition on p. 434). In 1921 Emmy Noether gave an extremely simple derivation of 
these results for arbitrary commutative rings satisfying the ascending chain condition for 
ideals. This paper, which by its effective use of conceptual methods, gave a new direction to 
algebra, has been one of the most influential papers on algebra published during this century. 

In this chapter we shall consider the ideal theory—once called additive ideal theory— 
which is an outgrowth of the Lasker-Macaulay-Noether theory. In recent years the emphasis has 
shifted somewhat away from the use of primary decompositions to other methods, notably, 
localization, the use of the prime spectrum of a ring, and the study of local rings. The main 
motivation has continued to come from applications to algebraic geometry. However, other 
developments, such as the study of algebras over commutative rings, have had their influence, 
and of course, the subject has moved along under its own power. 

We shall consider the Dedekind ideal theory in Chapter 10 after we have developed the 
structure theory of fields and valuation theory, which properly precede the Dedekind theory. 

Throughout this chapter (and the subsequent ones) all rings are commutative unless the 
contrary is explicitly stated. From time to time, mainly in the exercises, applications and 


extension to non-commutative rings will be indicated. The first nine sections are concerned 
with arbitrary commutative rings. The main topics considered here are localization, the method 
of reducing questions on arbitrary rings to local rings via localization with respect to the 
complements of prime ideals, the prime spectrum of a ring, rank of projective modules, and the 
projective class group. The ideal theory of noetherian rings and modules is developed in 
sections 7.10-7.18. Included here are the important examples of noetherian rings: polynomial 
rings and power series rings over noetherian rings. We give also an introduction to affine 
algebraic geometry including the Hilbert Nullstellensatz. Primary decompositions are treated 
in section 7.13. After these we consider some of the basic properties of noetherian rings, 
notably the Krull intersection theorem, the Hilbert function of a graded module, dimension 
theory, and the Krull principal ideal theorem. We conclude the chapter with a section on /-adic 
topologies and completions. 


7.1 PRIME IDEALS. NIL RADICAL 


We recall that an element p of a domain D is called a prime if p is not a unit and if pļab in D 
implies pla or p|b in D. This suggests the following 


DEFINITION 7.1. An ideal P in a (commutative) ring R is called prime if P + R and if ab 
e P for a,b e R implies eitheraeé Por be P. 


In other words, an ideal P is prime if and only if the complementary set P’ = R — P is closed 
under multiplication and contains 1, that is, P’ is a submonoid of the multiplicative monoid of 
R. In congruence notation the second condition in Definition 7.1 is that if ab = 0(mod P), then a 
= (0(mod P) or b = 0(mod P). The first condition is that the ring R = R/P + 0. Hence it is clear 
that an ideal P is prime if and only if R/P is a domain. Since an ideal M in a commutative ring 
R is maximal if and only if R/M is a field and since any field is a domain, it is clear that any 
maximal ideal of R is prime. It is clear also that an element p is prime in the usual sense if and 
only if the principal ideal (p)( = pR) is a prime ideal. Another thing worth noting is that if P is 
a prime ideal in R and A and B are ideals in R such that AB c P, then either A c PB CP. If 
not, then we have a € A, ¢ P and be B, € É P. Then ab € AB c P, contrary to the primeness 
of P. It is clear by induction that if P is a prime ideal and a, a)... a, E P, then some a; € P 


and if A, A,-A,, C P for ideals A;, then some A; c P. 


We recall the elementary result in group theory that a group cannot be a union of two proper 
subgroups (exercise 14, p. 36 of BAI). This can be strengthened to the following statement: If 
G,, Gz, and H are subgroups of a group G and H c (G) U G3), then either H c G, or H c G. 


The following result is a useful extension of this to prime ideals in a ring. 


PROPOSITION 7.1. Let A, 1,, ..., 1, be ideals in a ring such that at most two of the I; are 
not prime. Suppose A c U”, I. Then A CI; for some j. 


Proof. We use induction on n. The result is clear if n = 1. Hence we assume n > 1. Then if 
we have Ac/,U Uk, U S UT, for some k, the result will follow by the induction 


hypothesis. We therefore assume 4 éh vvuhu:=ul for k= 1, 2,...,n and we shall complete 
the proof by showing that this leads to a contradiction. Since 4¢ fv vhku- vi. , there exists 
an acA gh Ue UU Te k= 1,2,...., Since AC Ul, a, € Iņ. Ifn =2, it is readily seen that (as in 


the group theory argument) a, + a) € A but a, + a € J, U h, contrary to hypothesis. If n > 2, 
then at least one of the J; is prime. We may assume it is 74. Then it is readily seen that 


Ai +.0203°"* An 


is in A but is not in UZ. Again we have a contradiction, which proves the result. O 


The foregoing result is usually stated with the stronger hypothesis that every J; is prime. The 
stronger form that we have proved is due to N. McCoy, who strengthened the result still further 
by replacing the hypothesis that A is an ideal by the condition that A is a subrng (BAI, p. 155) 
of the ring R. It is clear that the foregoing proof is valid in this case also. In the sequel we shall 
refer to Proposition 7.1 as the “prime avoidance lemma.” The terminology is justified since the 
contrapositive form of the proposition is that if A and J}, ..., Z, are ideals such that A + I, for 


any j and at most two of the J; are not prime, then there exists ana € A such that a$ UI F 


There is an important way of obtaining prime ideals from submonoids of the multiplicative 
monoid of R. This is based on 


PROPOSITION 7.2. Let S be a submonoid of the multiplicative monoid of R and let P be 
an ideal in R such that (1) P N S = Ø. (2) P is maximal with respect to property (1) in the 
sense that if P' is any ideal such that P' È P, then P' N S + Ø. Then P is prime. 


Proof. Leta and b be elements of R such that a É P and b € P. Then the ideals (a) + P and 
(b) + P properly contain P and so meet S. Hence we have elements pj, p> € P, x1, X2 E R, S1, S2 


€ S such that s4 = xja + py, Sy) = xX2b + po. Then s; sy E S and 
SS = X,X,0b+X, ap, +X bp; +P;P>- 


Hence if ab € P, then s,s, € P, contrary to P N S= Ø. Thus ab € P and we have shown that a 
€P,bEP implies ab É P, so P is prime. O 


If S is a submonoid not containing 0, then the ideal 0 satisfies 0 N S = Ø. Now let A be any 
ideal such that A N S= Ø and let A be the set of ideals B of R such that B > A and B N S= Ø. It 
is an immediate consequence of Zorn’s lemma that A contains maximal elements. Such an 
element is an ideal P > A satisfying the hypotheses of Proposition 7.2. Hence the following 
result follows from this proposition. 


PROPOSITION 7.3. Let S be a submonoid of the multiplicative monoid of R and A an 
ideal in R such that S N A = Ø. Then A can be imbedded in a prime ideal P such that S N P = 
Ø. 


Let N denote the set of nilpotent elements of R. Evidently if z € N so z” = 0 for some n and 
if a is any element of R, then (az)” = a”z” = 0. Hence az e N. Ifz; € N, i= 1, 2, and n; is an 
integer such that = = 0, then by the binomial theorem for n = n; + n, — 1 we have (z; + Zz)” = 
X227", This is 0, since ifi <n, then n — i > m, so 3'1 '=0 and if i > ny, then clearly == = 0 
. Thus N is an ideal. We call this ideal the nil radical of the ring R and we shall denote it as 
nilrad R. If =z + Nis in the nil radical of R = R/N, N = nilrad R, then z” € N for some n and 
so z”™” = 0 for some integer mn. Then z € Nand £ = 0. Thus nilrad R = 0. We recall that any nil 
ideal of a ring is contained in the (Jacobson) radical rad R (p. 192). We recall also that rad R 
is the intersection of the maximal left ideals of R (p. 193), so for a commutative ring rad R is 
the intersection of the maximal ideals of the ring. The analogous result for the nil radical is the 
following 


THEOREM 7.1 (Krull). The nil radical of R is the intersection of the prime ideals of R. 


Proof. Let N= nilrad R and let N'=[ \P where the intersection is taken over all of the prime 
ideals P of R. Ifz e N, we have z” = 0 for some integer n. Then z” € P for any prime ideal P 
and hence z € P. Hence N c P for every prime ideal P, so N c N'. Now let s É N, so s is not 
nilpotent and S = {s"\” = 0, 1, 2, ...} is a submonoid of the multiplicative monoid of R 
satisfying S N {0} = Ø. Then by Proposition 7.3 applied to A = 0 there exists a prime ideal P 
such that P N S= Ø. Then s É P and s € N'. This implies that N' c N and so N= N' =[(\P. O 


If A is an ideal in R, we define the nil radical of A, nilrad A (sometimes denoted as v^), to 
be the set of elements of R that are nilpotent modulo A in the sense that there exists an integer n 
such that z” € A. This is just the set of elements z such that # = z + A is in the nil radical of R = 
R/A. Thus 


(1) nilrad A = y~! (nilrad R) 


where v is the canonical homomorphism of R onto R. It follows from this (or it can be seen 
directly) that nilrad A is an ideal of R containing A. It is clear also that iteration of the process 
of forming the nil radical of an ideal gives nothing new : nilrad (nilrad A) = nilrad A. 

We have the bijective map B~ B = B/A of the set of ideals of R containing A onto the set of 
ideals of R = R/A. Moreover, R/B = R/B (BAI, p. 107). Hence B is prime in R if and only if B is 
prime in R. Thus the set of prime ideals of R is the set of ideals P = P/A where P is a prime 
ideal of R containing A. Since | | (P/A) = (/ \P)/A = (BAL, p. 67, exercise 2), and | P taken over 
the prime ideals of R is the nil radical of R, we have 


((\P)/A = (\(P/A) = (\P =nilrad R 
= nilrad R/A. 


Hence nilrad A =! |P taken over the prime ideals P of R containing A. We state this as 


(2) 


THEOREM 7.2. The nil radical of an ideal A is the intersection of the prime ideals of R 
containing A. 


EXERCISES 


1. Let A;, 1 <i <n, be ideals, P a prime ideal. Show that if P >[ V; A; then P > 4; for some 
i and if P=! \", A, then P = A; for some i. 


2. Show that if P is a prime ideal, then S = R — P is a submonoid of the multiplicative 
monoid of R, which is saturated in the sense that it contains the divisors of every s € S. 
Show more generally that if {P,} is a set of prime ideals, then S = R — U P, is a 


saturated submonoid of the multiplicative monoid of R. Show that conversely any 
saturated submonoid of the multiplicative monoid of R has the form R — U P, {P,,} a set 


of prime ideals of R. 
3. Show that the set of zero divisors of R is a union of prime ideals. 


4. (McCoy.) Show that the units of the polynomial ring R[x], x an indeterminate, are the 
polynomials dg + a, x + + a,x" where dg is a unit in R and every a; i > 0, is nilpotent. 
(Hint: Consider the case in which R is a domain first. Deduce the general case from this 
by using Krull’s theorem.) 


The next two exercises are designed to prove an important result on the radical of a 
polynomial ring due to S. Amitsur. In these exercises R need not be commutative. 


5. Let f(x) € R[x] have 0 constant term and suppose f(x) is quasi-regular with quasi-inverse 
g(x) (p. 194). Show that the coefficients of g(x) are contained in the subring generated 
by the coefficients of f(x). 


6. (Amitsur.) Show that R[x] is semi-primitive if R has no nil ideals + 0. (Hint: Assume that 
rad R[x] # 0 and choose an element + 0 in this ideal with the minimum number of non- 
zero coefficients. Show that these coefficients are contained in a commutative subring B 
of R. Apply exercises 4 and 5.) 


7.2 LOCALIZATION OF RINGS 


The tool of localization that we shall now introduce is one of the most effective ones in 
commutative algebra. It amounts to a generalization of the familiar construction of the field of 
fractions of a domain (BAI, pp. 115-119). We can view this generalization from the point of 
view of a universal construction that is a solution of the following problem: 

Given a (commutative) ring R and a subset S of R, to construct a ring R, and a 
homomorphism 4, of R into R, such that every 1,(s), s € S, is invertible in R,, and the pair (R,, 
4,) is universal for such pairs in the sense that if 7 is any homomorphism of R into a ring R’ 
such that every 7(s) is invertible, then there exists a unique homomorphism 7 : R, — R' such 
that the diagram 


is commutative. 
Before proceeding to the solution of the problem (which is easy), we shall make some 
remarks about the problem. 


1. Since the product of two elements of a ring is invertible if and only if the elements are 
invertible, there is no loss in generality in assuming that S is a submonoid of the multiplicative 
monoid of R. The solution of the problem for an arbitrary set S can be reduced to the case in 
which S is a monoid by replacing the set S by the submonoid <$) generated by S. For example, 
if Sis a singleton {s}, then we can replace it by <> = {s”"|n = 0, 1, 2, ...}. 

2. The special case of the field of fractions is that in which R is a domain and S = R*, the 
submonoid of non-zero elements of R. In this case nothing is changed if we restrict the rings R’ 
in the statement of the problem to be fields. This is clear since the image under a 
homomorphism of a field either is the trivial ring consisting of one element or is a field. 

3. Since a zero divisor of a ring ( # {0}) is not invertible, we cannot expect A, to be 
injective if S contains zero divisors. 

4. If the elements of S are invertible in R, then there is nothing to do: We can simply take Rs 
= R and A, = lp, the identity map on R. Then it is clear that 7 = 7 satisfies the condition in the 
problem. 

5. If a solution exists, it is unique in the strong sense that if (Rs, As) and (Rs, As) 


satisfy the condition for the ring R and subset S, then there exists a unique isomorphism ¢ : Ro? 


— Rs such that 


is commutative (see p. 44). 


We now proceed to a construction of a pair (R, 4,) for any ring R and submonoid S of its 


multiplicative monoid. As in the special case of a domain R and its monoid S = R* of non-zero 
elements, we commence with the product set R x S of pairs (a, s), ae R, s € S. Since the 
monoid S may contain zero divisors, it is necessary to modify somewhat the definition of 
equivalence among the pairs (a, s) that we used in the construction of the field of fractions of a 
domain. We define a binary relation ~ on R x S by declaring that (a), s1) ~ (az, $2) if there 


exists ans € S such that 
(4) S($,4, —S,a,) = 0 


(Or ss24] = ss4a2). If R is a domain and S does not contain 0, then (4) can be replaced by the 
simpler condition sa, = sa. In the general case it is readily verified that ~ is an equivalence 
relation. We denote the quotient set defined by this equivalence relation as R, and we denote 
the equivalence class of (a, s) by a/s. 

We define addition and multiplication in the set R, by 


(5) A;/8, +a,/s, = (S241 +5,a)/s)5; 
and 
(6) (a,/81) (a3/s2) = ajaz/SiSż 


It is a bit tedious but straightforward to check that these compositions are well-defined and that 
if we put 0 = 0/1 (= 0/s for any s € S) and 1 = 1/1 ( = s/s, s € S), then (Ry, +, -, 0, 1) is a ring. 
We leave the verifications to the reader. 

We now define a map åș (or 4?) of R into Ry by 


(7) As:a~a/l. 


It is clear from (5) and (6) and the definition of 1 in Rg that A, is a homomorphism of R into Rs. 
Moreover, ifs € S, then A.(S) = s/1 has the inverse 1/s since (s/1) (1/s) = s/s = 1. Now let 7 be 
a homomorphism of R into R’ such that 7(s) is invertible for every s e S. It is readily verified 


that 


(8) Ñ :a/s > n(a)n(s) 


is well-defined and this is a homomorphism of Rg into R'. Moreover, Asla) =° (a/l = n(a). 
Thus we have the commutativity of (3). The uniqueness of " is clear also since a/s = (a/1) 
(1/s) = (a/1) (s/1)- ' and hence any ï satisfying the commutativity condition satisfies i(a/s) =" 
(A(a)A,(s)') = n(a)n(s) ~ |. Thus "i is the map we defined in (8) and (R,, 4,) is a solution of 
the problem we formulated at the outset. We shall call (R,, 2,) (or simply R,) the localization 
of RatS. 


We shall now study relations between R, and Ry for S’, a submonoid of the monoid S. Note 
first that A(s’) is invertible in R, s’ € S". Hence we have a unique homomorphism Cy: 5 : Ry > 
R, such that 2, = Cy. s A. Now suppose that 4,(s) is invertible in Ry, s € S. Then we also have 
a unique homomorphism ¢,, s’: R, > Ry such that 4y. = ¢, y» As. It follows that Cs y Css = Ips, 
and Cs ss, s' = lp. Hence Cy. 5 and ¢s s are isomorphisms. We shall now show that in general 
if S' c S, then R, is, in fact, a localization of Ry. We state the result in a somewhat imprecise 
manner as follows. 


PROPOSITION 7.4. Let S' be a submonoid of S and let S/S' = {s/s'|\s € S, s'e S'}. Then S/S 
‘is a submonoid of the multiplicative monoid of Ry and we have canonical isomorphisms of 


R, and (Ry); (sy and of (Rey); (sy and (Ry) si 


Proof. We shall obtain the first isomorphism by showing that (Rg, Va gS) has the universal 
property of R,. First, we have the composite homomorphism 4, xs) Ay, of R into (Rg); 5) 
obtained from the sequence of homomorphism 


As’ ^y (S) 
R—— Ry —> (Rs Jiss} 


Now let 7 be a homomorphism of R into R' such that 7(s) is invertible, s € S. Since S' c S, we 
have a unique homomorphism” of Ry into R' such that %4, = 7. Ifs € S, then 7'(Ag(s)) = y(s) is 
invertible in R’. Accordingly, by the universal property of (R,)A,(S) we have a unique 


homomorphism " of (R,’);.(s) into R’ such that 743s) =". Then "0s = 7s =", Moreover, ” is 
the only homomorphism of (Ry);.(s) into R' satisfying "“s.«ss)=" For, this condition implies 
that ("4.948 = 14s, which implies that %9 = by the universality of (Rs-4s). Then we obtain the 
uniqueness of " by the universality of (&s):is»4i0), Thus Rs)» “0s! has the universal 


property of (Ry, A, ) and so we have the required isomorphism. 
The isomorphism of (Ry), (5) and (Ry)sj can be seen by observing that A,(S) is a 


submonoid of S/S’. Hence we have the canonical homomorphism of (Rg)iss) into (Rs) sis. If s 
e S, s'e S', then s/s’ = (s/1)(s‘/l) | in Ry and ĉi) and 4098) are invertible in (Rg Asis): 


Hence 2:..s(s/s)) is invertible. It follows from the result we obtained above that the canonical 
homomorphism of (Ry); (s) into (Ry),j" 18 an isomorphism. O 


EXERCISES 


1. Let S and T be submonoids of the multiplicative monoid of R. Note that ST = {sts € S, t € 
T} is the submonoid generated by S and T. Show that Ro = (R,)A,AT). 


2. Leta, b e R. Show that Rehan = Re , 
3. Let S be a submonoid of the multiplicative monoid of R. Ifa, b € S, define < =< if aļb in 
S. In this case there is a unique homomorphism =e% of ®«~*» such that 4» A 


Show that “*=""*" (with respect to the ¢ ’s). 


7.3 LOCALIZATION OF MODULES 


It is important to extend the concept of localization to R-modules. Let M be a module over R, S$ 
a submonoid of the multiplicative monoid of R. We shall construct an R,-module M, in a 


manner similar to the construction of R,. We consider M x S the product set of pairs (x, s), xe 
M, s € S, and we introduce a relation ~ in this set by (x), s1) ~ (%, Sz) if there exists ans € S 
such that 


The same calculations as in the ring case show that ~ is an equivalence. Let M, denote the 
quotient set and let x/s be the equivalence class of (x, s). We can make M, into an R,-module 
by defining addition by 

(10) X 1/8; +X2/S_ = (S2X1 +S1X2)/8182 


and the action of R, on M, by 


(11) (ajs) (x/t) = ax/st. 


We can verify as in the ring case that (10) and (11) are well-defined, that + and 0 = 0/s 
constitute an abelian group, and that (11) defines a module action of R, on M,. We shall call the 


R -module M, the localization of M at S or the S-localization of M. 
Although we are generally interested in M, as R,-module, we can also regard M, as R- 


module by defining the action of R by a(x/s) = (a/1) (x/s) = ax/s. Since a ~ a/l is a ring 
homomorphism, it is clear that this is a module action. We have a map 4, (or 2,” if it is 
necessary to indicate M) of M into M, defined by x ~ x/1. This is an R-module homomorphism. 
The kernel of 1, is the set of x e M for which there exists an s € S such that sx = 0, that is, if 
ann x denotes the annihilator ideal in R of x, then ann x N S # Ø. It is clear that 2, need not be 


injective. For example, if Mis a Z-module and S = Z — {0}, then the torsion submodule of M is 
mapped into 0 by A,. It is clear that if S includes 0, then M, = 0 and ker A, = M. Another useful 


remark is 


PROPOSITION 7.5. IfM is a finitely generated module, then M, = 0 if and only if there 
exists an s € S such that sM =Q. 


Proof. IfsM =0, every x/t = 0 = 0/1 since s(lx) = s(t0) = 0. Conversely, suppose M, = 0 and 
let {x}, ..., X„} be a set of generators for M. Then x,/1 = 0 implies there exists an s; € S such 
that sx; = 0. Then sx; = 0 for s = [| J”; s; ; hence sx = 0 for any È rx, r;e R Thus sM=0. O 


Let f: M — N be a homomorphism of .R-modules of M into N. Then we have a 
corresponding R -homomorphism f, of M, into N, defined by 


(12) fst) = f (x)/t. 


Again we leave the verifications to the reader. The maps M ~ M,, f ~ f, define a functor, the S- 
localization functor, from the category of R-modules to the category of R,-modules. We shall 
now show that this functor is naturally isomorphic to the functor R, ®p from R-mod to R,-mod. 
Thus we have 


PROPOSITION 7.6. For every R-module M we can define an R,-isomorphism ny of M, 
onto R,® pM that is natural in M. 


Proof. We show first that there is a map yy of M, into R,® p M such that 


(13) x/s ~ (1/s)@x. 


Suppose x,/s = X>/s>, which means that we have ans € S such that ssx; = ss ,x>. Then 


(I/s, \@x, = (SS3/55251 \@x, = SS>( | [S8281 )\@x, 


(1/s595,)@ss 2X, = (1/ss25;)®@ss 1X2 


SSi (1/ss55,)@Xq = (SS1/S825,)@X2 


= (1/s:)® X2. 


Hence (13) is well-defined. Direct verification shows that n} is a group homomorphism. We 
note next that we have a well-defined map of the product set R, x M into M, such that 


(14) (a/s,x) ~ ax/s. 


To check this we have to show that if a,/s; = a/s, in R, then a,x/s) = dox/s>. Now if a/s} = 
a,/s,, then we have an s € S such that sas) = saps}. Then sa,s.x = sas,;x, which implies the 
required equality a,x/s, = a>x/sy. Direct verification shows that (14) satisfies the condition for 
a balanced product of the .R-modules R, and M. Hence we have a group homomorphism 7’ jy 
of R,® pM into M, sending (a/s)®x into ax/s. Following this with yọ we obtain (1/s)®ax = 
(a/s)®x. On the other hand, if we apply 7, to x/s, we obtain (I/s)® x and the application of n'm 
to this gives x/s. It follows yyn'y is the identity map on R,® pM and y'm is the identity on 
M,. Hence ny is a group isomorphism and so is n'm = 7” tų. Direct verification using the 
definitions shows that 7,, and hence is an R,-map, hence an R,-isomorphism. It remains to 
show the naturality, that is, the commutativity of 


Ms Mu Rz © _M 
Ís 1Of 
Ng Te Ry ® pN 


for a given R-homomorphism f : M — N. This follows from the calculation 
(1@f Muls) = 1f) ((1/s)@x) = (1/s) @f (x), 
nwfs(x/s) = ny(f (x)/s) = (1/s)@f (x), 


which completes the proof. O 


This result gives rise to a useful interplay between tensor products and localization. We 


recall first that the functor N® p is right exact, that is, exactness of M’4 M & M” 0 implies that 
of N @xM'—N OMN OM” 79, Applying this with N = R, in conjunction with Proposition 
7.6 shows that Ms“ Ms # Mj +0 is exact. Next we prove directly that if 9M’ 4 M is exact, then 
0— Ms Ms is exact. Suppose f,(x' /s) = 0, so f(x’)/s = 0. Then we have a t e S such that f(x’) = 
0. Then f(tx’) = 0 and since f is injective, tx’ = 0 and hence x'/s = 0. Hence ker f, = 0 and so 


, fs . ee 
0=>Ms>Ms is exact. We can now apply Proposition 7.6 to conclude that 


a! 
0—> Rs @ eM’ "Rs @eM is exact. We recall that this is the definition of flatness for R, as R- 


module (p. 153). Hence we have the important 


PROPOSITION 7.7. R, is a flat R-module. 


We remark also that we have shown that if 0 — M' — M — M" — 01s an exact sequence 
of R-modules, then 0 — M', — M, — M", — 0 is an exact sequence of R,-modules. Thus the 
S-localization functor from R-mod to R,-mod is exact. 

Now let N be a submodule of M. Then the exact sequence 0+ N  M where / is the injection 
gives the exact sequence 9— Ns > Ms and the definition shows that Z, is the injection of N,. We 
have the exact sequence 9+ N >M > M/N +0 where v is the canonical homomorphism. Hence 
we have the exact sequence 9 Ns ® Ms +(M/N)s +0, which shows that M/N, = (M/N),. More 


precisely, the foregoing exactness shows that the map 


(15) x/S+t+N 5 > (x+N)/s 


is an isomorphism of M/N, with (M//N),. 
If M and N are R-modules and f is a homomorphism f : M — N, then we have the exact 
sequence 


0+ kerf>M4N > cokerf> 0 
where coker f= M/f (M). Hence we have the exact sequence 


(16) 0> (kerf) 3 Ms 8 Ns 3 (coker f), > 0. 


This implies that (ker f), is ker fg. Also (coker f); = (VAM) )s = N4AM)s = Ns/fs(M) = coker 
fs- 

We recall next that we have an R,-isomorphism of Rs® x(M® kN) onto (Rs® rM) rIRs®N) 
sending ! ®alv® ry) into (1@ xx)@x,(1@ ay) (exercise 13, p. 148). Hence, by Proposition 7.6, we 
have an R,-isomorphism of (M® RN), onto M,® g N, such that 


(17) (xry) > (x/1)8r, 9/1). 


Now let M be an R,-module. Then M becomes an R-module by defining ax for a € R to be 
(a/l)x. Itis clear that if fis a homomorphism of M as R,-module, then f is an R-homomorphism. 
Now consider M, where M is regarded as an R-module. Since s/1 is invertible in R, for s € S, 
sx = (s/1)x = 0 for x e M implies x = 0. Hence the homomorphism 4, : x ~ x/1 of M into M, is a 
monomorphism. Since x/s = (1/s)(x/1), 4, is surjective. Thus we can identify M as R -module 
with the S-localization M, of M as R-module. 


7.4 LOCALIZATION AT THE COMPLEMENT OF A PRIME IDEAL. 
LOCAL-GLOBAL RELATIONS 


We recall that an ideal P in R is prime if and only if the complement, R — P of P in R, is a 
submonoid of the multiplicative monoid of R. Of particular importance are localizations with 
respect to such monoids. We shall usually write Mp for Mp_p, fp for fp _ p, etc. and call Mp the 


localization of M at the prime ideal P. 

We consider first a correspondence between the ideals of R and Rp and we shall begin by 
considering more generally the ideals in R and in R, for any submonoid S. Let A’ be an ideal in 
R,. Then 


(18) j(A’) = {ae R|a/se A’ for some seS} 


is an ideal in R. Clearly (j(4’)), = A’. Again, if we begin with an ideal A in R, then it is easily 
seen that A, = R, if and only if A contains an element of S. For this reason it is natural to 
confine our attention to the ideals A of R that do not meet S. 

We observe next that if P is a prime ideal of R such that P N S = Ø, then j(Ps) = P. For, let 
a € j(Ps). Then we have an element p € P and elements s, t € S such that a/s = p/t. Hence we 


have a u € S such that uat = ups € P. Since ute S and S N P = Ø, this implies that a € P. 
Hence j(Ps;) c P. Since the reverse inclusion j(A,) > A holds for any ideal A, we have j(Ps;) = 
P. It is straightforward to verify two further facts: If P is a prime ideal of R such that P N S = 
Ø, then P, is a prime ideal in R, and if P' is a prime ideal in R,, then j(P’) is prime in R that 
does not meet S. We leave this to the reader. Putting together these results we obtain 


PROPOSITION 7.8. The map P ~ P, is bijective and order-preserving from the set of 
prime ideals of R, which do not meet S with the set of prime ideals of R,. The inverse map is 
P' ~ j(P’) (defined by (18)). 


We now consider the important case of localization at a prime ideal P. Since Q N (R — P) = 
Ø means Q c P, the foregoing result specializes to 


PROPOSITION 7.9. The map Q ~ Qp is a bijective order-preserving map of the set of 
prime ideals Q contained in P with the set of prime ideals of Rp. The inverse map is P' ~ j(P 


n. 


It is clear that P, contains every prime ideal of Rp. It is clear also that the elements not in 
P, are units in Rp and since Rp # Pp, no element of P, is a unit. Thus P, is the set of non-units 
of Rp and hence Rp is a local ring with rad Rp = P, as its only maximal ideal. We repeat the 
statement of this result as 


PROPOSITION 7.10. Rp is a local ring with rad Rp = P, as its only maximal ideal. 


This fact accounts for the central importance of localization : It often permits a reduction of 
questions on commutative rings and modules over such rings to the case of local rings, since in 
many important instances a result will be valid for R if it holds for every Rp, P a prime ideal in 


R. The following result gives some basic properties of modules that hold if and only if they 
hold at all the localizations at prime ideals. 


PROPOSITION 7.11. (1) Let M be an R-module. If M = 0, then M, = 0 for every 
localization and if Mp = 0 for every maximal P, then M = 0. (2) If M and N are R-modules 
and f is a homomorphism of M into N, then f injective (surjective) implies that f, is injective 
(surjective) for the localizations at every S. On the other hand, if fp is injective (surjective) 
for every maximal ideal P, then fis injective (surjective). (3) If M is flat, then so is every M, 
and if Mp is flat for every maximal ideal P, then M is flat. 


Proof. (1) Evidently M = 0 implies My, = 0. Now assume M + 0 and let x be a non-zero 
element of M. Then ann x # R, so this ideal can be imbedded in a maximal ideal P of R. We 
claim that x/1 # 0 in Mp, so Mp £ 0. This is clear since x/l is the image of x under the 
canonical homomorphism of M into Mp and if x/l = 0, then ann x N (R — P) + Ø. Since ann x c 
P, this is ruled out. 

(2) Let f': M — N. Then we have seen that ker f, = (ker f), and coker f, = (coker f), for any 
submonoid S of the multiplicative monoid of R. Since a homomorphism is injective 
(surjective) if and only if its kernel (cokernel) is 0, (2) is an immediate consequence of (1). 

(3) Suppose that M is a flat R-module and 0 N'N is an exact sequence of Ry-modules. 
Regarding N and N as R-modules, we see that 
0— M @,N' > M @,Nisexact. Then 0— (M@,N’)s head (IM®RN)s is exact, and by (16), 


ve l i fy > > ë . . . 
0—> Ms 8r, Ns — Ms@®r.Ns is exact. We have seen that N’, and N, can be identified with N’ and N 


„ 18 , 
respectively. Hence V7 M®r N —>Ms®aN ig exact and so Mg is Rg-flat. Now suppose that 


M is an R-module such that Mp is R,,-flat for every maximal ideal of P of R. Let 0+ N' 4, N be 


. „ 18 a . . p? ? 

an exact sequence of R-modules and consider M®rN'——M®:N. Since Mp is flat and 9> N> Nr 
ah p? 1@/, A Š r? LEGIT f ë è . 

is exact, 0° Mr 8r, Np —Mr®r Nr is exact. Then 0~(M®rN')—(M®:N)r’ is exact. Since this 


= lor = . 
holds for every maximal ?: 9 M@,xN'—+ M@xN is exact by (2). Hence M is flat. O 
y y 


EXERCISES 


1. Show that the nil radical of Rg is (nilrad R)ş. 


2. Show that if P is a prime ideal of R, then Rp/Pp is isomorphic to the field of fractions of 
the domain R/P. 


3. Show that if R is a factorial domain (BAI, p. 141) and S is a submonoid of the 
multiplicative monoid of R not containing 0, then Rg is a factorial domain. 


4. Let {P| 1 <i <n} be a set of prime ideals in R and let S =| F(R —P;). Show that any 
prime ideal of Ry has the form P,; where P is a prime ideal contained in one of the P;. 


7.5 PRIME SPECTRUM OF A COMMUTATIVE RING 


Let R be a commutative ring and let X = X(R) denote the set of prime ideals of R. There is a 
natural way of introducing a topology on the set X that permits the introduction of geometric 
ideas in the study of the ring R. We proceed to define this topology. 

If A is any subset of R, we let V(A) be the subset of X consisting of the prime ideals P 
containing A. Evidently A) = V(/(A)) where /(A) is the ideal generated by A and since the 
nilradical of an ideal Z is the intersection of the prime ideals containing Z, it is clear that VJ) = 
V(nilrad J). Also if P is a prime ideal containing J; /, for J,, J ideals, then either P > J, or PD 
D. Hence VI, D) = VU) U L). We can now verify that the sets V/A), A a subset of R, satisfy 


the axioms for closed sets in a topological space: 
(1) Ø and X are closed sets, since Ø = V({1}) and X= V({0}). 


(2) The intersection of any set of closed sets is closed, since if {A,} is a set of subsets of R, 
then 


N V(A,) = V(L(A,)). 


(3) The union of two closed sets is closed, since we can take these to be V(/,) and VU), J; 
ideals, and then (4) U Vh) = VU, D). 


We shall call X equipped with this topology the prime spectrum of R and denote it as Spec 
R. The subset Xmax of X consisting of the maximal ideals of R with the induced topology is 


called the maximum spectrum. This will be denoted as Maxspec R. Such topologies were first 
introduced by M. H. Stone for Boolean rings and were considered by the present author for the 
primitive ideals of an arbitrary ring. In the case of commutative rings the topology is called the 
Zariski topology of X. 

The open sets in X= Spec R are the complements X- V(A) = X—! | ae A Vas) =U ae WX 
— V({a})). We denote the set X— V({a}) as X, for a € R. This is just the set of prime ideals P 


not containing a, that is, the P such that a =a + P + 0 in R/P. Since any open set is a union of 
sets X,,, these open subsets of X constitute a base for the open sets in Spec R. It is worthwhile 


to list the following properties of the map a ~ X, of R into the set of open subsets of Spec R: 


(2) X, =Ø if and only if a is nilpotent (since nilrad R is the intersection of the prime ideals 
of R). 


(3) X, =X ifand only ifa is a unit in R. 


If Y is a subset of X, put Ay =!\p~ y P. This is an ideal in R and (Ay) is a closed set 
containing Y. On the other hand, if V(A) > Y, then P > A for every P € Yso AyD A and V(A) > 
V(A,). Thus v(Ay) is the closure of Y, that is, the smallest closed set in X containing Y. In 


particular, we see that the closure of a point P is the set of prime ideals containing P. If R is a 
domain, 0 is a prime ideal in R and hence the closure of 0 is the whole space X. 


EXAMPLES 


1. R=2Z. As we have noted, the closure of the prime ideal 0 is the whole space X(2). Hence 
Spec Z is not a 7)-space (a space in which points are closed sets). Now consider Maxspec Z. 
The maximal ideals of Z are the prime ideals (p) # 0. Hence the closure of (p) £ 0 is (p) and so 
Maxspec Z is a T;-space. Let Y be an infinite set of primes (p) + 0 in Z. Evidently | ae p= 
0, so V(Ay) = X. Thus the closure of any infinite subset of Maxspec Z is the whole space. 


Hence the closed sets of Maxspace Z are the finite subsets (including Ø) and the whole space. 
Evidently the Hausdorff separation axiom fails in Maxspec Z. 


2. R = Fx, ..., x], F a field, x; indeterminates. For r = 1, the discussion of Spec and 
Maxspec is similar to that of the ring Z. For arbitrary r we remark that F[x), ..., x,.]/(x,) = Fx, 
..., X], So the prime ideals of F[x,, ..., x,] containing (x,) are in 1—1 correspondence with the 


prime ideals of F[x,, ..., x,]. Hence the closure of (x,) in Spec F[x,, ..., X] is in 1-1 
correspondence with Spec F[Xx5, ..., x]. 


We shall now derive some of the basic properties of the prime spectrum. We prove first 


PROPOSITION 7.12. Spec R is guasi-compact. 


Proof. This means that if we have a set of open subsets O, such that UO, = X, then there 
exists a finite subset O,,, ..., O, of the O, such that UO, = X. (We are following current 
usage that reserves “compact” for “quasi-compact Hausdorff.”) Since the sets X, form a base, 
it suffices to show that if Uae A X, = X, then UX, = X for some finite subset {a;} of A. The 
condition Uae A X, = X gives X — V(A) = X and V(A) = Ø. Then V(/(A)) = Ø for the ideal (4) 
generated by A and so /(A) = R. Hence there exist a; € A, x; € R such that }”;ax;= 1. Retracing 
the steps, we see that V({a;}) = Ø and UX, =X, B 


Let N=nilrad R, R = R/N, and let v be the canonical homomorphism a — a + N of R onto R 
. Any prime ideal P of R contains N and v(P) is a prime ideal of R. Moreover, every prime 
ideal of R has the form v(P), P a prime ideal of R. We have 


PROPOSITION 7.13. The map P ~ v(P) is a homeomorphism of Spec R onto Spec R. 


Proof. The map is injective, since P > N for every prime ideal P and we have seen that the 
map is surjective. Now if a €e R, v(a) € v(P) if and only if a € P. This implies that if A is a 
subset of R, then the image of the closed set V(A) in Spec R is V(v(A)) in Spec R and if A is a 
subset of R, then the inverse image of the closed set V(A) is Viv" '(A)). Hence P ~ v(P) is a 
homeomorphism. O 


We recall that a space X is disconnected if it contains an open and closed subset # Ø, #.X. 
We shall show that Spec R is disconnected if and only if R contains an idempotent + 0, 1. This 
will follow from a considerably stronger result, which gives a bijection of the set of 
idempotents of R and the set of open and closed subsets of Spec R. To obtain this we shall 
need the following result on lifting of idempotents, which is of independent interest. 


PROPOSITION 7.14. Let R be a ring that is not necessarily commutative, N a nil ideal in 
R, and #=u + N an idempotent element of R = R/N. Then there exists an idempotent e in R 
such that è =. Moreover, e is unique if R is commutative. 


Proof. We have u? — u =z where z is nilpotent, say, z” = 0 and z € N. Then (u(1— u))” = uv" 
= 0 where v = 1 — u. From u + v = 1 we obtain 


fe] = (uto) oe etf 


where e is the sum of the terms u'y?” ~ !-' in which n < i < 2n— 1 and fis the sum of the terms 
uiy?” —!—1 in which 0 <i<n-— 1. Since u”v” = 0, any term in e annihilates any term in f. Hence 
ef = 0 = fe. Since e + f= 1, this gives e? = e, f° = f. Every term in e except u?” ~ | contains the 
factor uv =— z. Hence u? t= 


n- l= e (mod N). Since u = u? = w = =u" | (mod N), we have e 
= u (mod N). This proves the first assertion. 


Now assume that R is commutative. The uniqueness of e will follow if we can show that if 
e is an idempotent, then the only idempotent of the form e + z, z nilpotent, is e. The condition (e 
+z)? =e +z gives (1 — 2e)z = z*. Then z? = (1 — 2e)z* = (1 — 2e)*z and by induction we have 
(1 —2e)"z =z""!, Since (1 — 2e} = 1 — 4e + 4e = 1, this implies that z = 0 and hence e + z = e. 
o 


Ife and fare idempotents in R, then so are e'= 1 — e, ef, and eo f=1-(1-e(l-f)=e+f 
— ef. It is readily verified that the set E of idempotents of R is a Boolean algebra with the 
compositions e A f= ef and e v f= e o f (exercise 1, p. 479 of BAT). We remark next that the 
open and closed subsets of a topological space X constitute a subalgebra of the Boolean 
algebra of subsets of X. We can now prove 


THEOREM 7.3. Ife is an idempotent in R, then X, is an open and closed subspace of Spec 
R and the map e ~ X, is an isomorphism of the Boolean algebra E onto the Boolean algebra 
of open and closed subsets of Spec R. 


Proof. Let e =e? e R. Then e(1 — e) = 0, so any prime ideal P of R contains one of the 
elements e, 1 — e but not both. Hence XY, U X,_, =X and X, N Xi - e = Ø, so X, is open and 


closed. Now let Y be an open and closed subset of X, Y = X— Y. Let R = R/N, N = nilrad R, v 
the canonical homomorphism of R onto R. We use the homeomorphism P ~ v(P) of X = Spec R 
with x = Spec R to conclude that v(Y) is an open and closed subset of x and v(Y’) is its 
complement. Consider Ay) and Ag as defined on p. 404. These are ideals in R and VAr) 


= WY), Ayy) = x(Y) since v(Y) and v(Y') are open and closed in x. If P is a prime ideal of R 
containing Ag) + Agn» then P D Apo) and P D Ayyy, so PE V(Y) N (Y). Since v(¥) N (Y) = 
Ø, there are no such P and so Ayy + Ayy) = R. If P is any prime in R, either P > Ayy or PD 
Avy, 80 P D Ayy) N Ayyy Since this holds for all P and R has no nilpotent elements # 0, Ayy 
N Ag) = 0. Hence = AmS», Let ë be the unit of Ay) and let e be the idempotent in R 


such that v(e) = ë. Now if P is an ideal in R, the condition P ə e is equivalent to v(P) ə ë, which 
in turn is equivalent to v(P) > Ayy) and to v(P) € v(Y) and Pe Y'. Hence X, = Y, which 


shows that the map e ~ X, is surjective. To see that the map is injective, let e be a given 
idempotent in R, ë = v(e). Then we have R = Rè®R(1-ë) and Re and R(T — è) are ideals. Now 
v(X,) is the set of prime ideals of x containing! — e, hence x(!— e), and v(Xj _ ,) is the set of 


prime ideals containing Re. We have shown that 

R = Any Any,» Since Auxa > R(T-ë) and Avs,» >Re and R=Re@RI-2, we have 

Anxa = R(l-@). Since 1 — e is the only idempotent in the coset T — ë, by Proposition 7.14, this 

implies the injectivity of e ~ X,. If e and fare idempotents, then X, = X, N Xp Also Xi -e =X 

— Xo X1 = X, Xo = Ø. These relations imply that e ~ X, is an isomorphism of Boolean algebras. 
o 


Evidently we have the following consequence of the theorem. 


COROLLARY. Spec R is connected if and only if R contains no idempotent # 0, 1. 


Because of this result, a commutative ring R is called connected if the only idempotents in 
R are 0 and 1. 


Let f be a homomorphism of R into a second ring R’. If P' is a prime ideal in R’, then P = f~ 
'(P') is a prime ideal in R since if ab € P, then f(a)f(b) = Kab) € P'. Thus either f(a) or Kb) € 
P' and hence either a or b e P. This permits us to define a map 


Pa TP) 


of X = Spec R' into X= Spec R. Ifa € R, then a prime ideal P' of R' does not contain a’ =f (a) 
if and only if P = f*(P’) does not contain a. Hence 


(f*)"(Xq) = Xray 


This implies that the inverse image of any open subset of Spec R under /* is open in Spec R’, 
so f* is a continuous map of Spec R’ into Spec R. It is clear that if R’ = R and f= Lp, then f* = 
lspec R and if g is homomorphism of R' into R” then (fg)* = g*f*. Thus the pair of maps R ~ 
Spec R, f ~ f* define a contravariant functor from the category of commutative rings 


(homomorphisms as morphisms) into the category of topological spaces (continuous maps as 
morphisms). 


EXERCISES 


1. Let f be a homomorphism of R into R’, f* the corresponding continuous map of Spec R’ 
into Spec R. Show that if fis surjective, then f*(Spec R’) is the closed subset V(ker f) of 
Spec R. Show also that f* is a homeomorphism of Spec R’ with the closed set V(ker f). 


2. Same notations as exercise 1. Show that /*(Spec R’) is dense in Spec R if and only if 
nilrad R > ker f. Note that this holds if fis injective. 


3. Give an example of a homomorrphism f of R into R' such that for some maximal ideal M' 
of R', f- ! (M’) is not maximal in R. 


4. (D. Lazard.) Show that if P is a prime ideal in R and Z is the ideal in R generated by the 
idempotents of R contained in P, then R/I contains no idempotents # 0, 1. Show that the 
set of prime ideals P’ > / is the connected component of Spec R containing P (the largest 
connected subset containing P). Hint: Let ü be an idempotent of R = R/I where ñ= u + J. 
Then u(1 — u) € I c P and we may assume u € P. Show that there exists an f € J such 
that f? = fand f(u — u?) = u — u. Then (1 — fu = g is an idempotent contained in P, so g 
e I. Thenu=g+fue Jand#=0.) 

In exercises 5 and 6, R need not be commutative. 


5. Let N be a nil ideal in R and let ñ, ..., &, be orthogonal idempotent elements of 
R=R/N (ij =i, ğü=0 if i#j), Show that there exist orthogonal idempotents e; in R such 


that ë; = ñ, 1 <i < n. Show also that if X} "ñ, = 1, then necessarily Ye; = 1 


6. Let R, N, R be as in exercise 5 and let ü; , 1 < ij <n, be elements of R such that 
Üjüu = Oaity, Ly = l, Show that there exist œe; € R such that 


y 


Cu = One ey = 1,6, = ty, 1 Sijs n 


P 4 


7.6 INTEGRAL DEPENDENCE 


In BAI, pp. 278-281, we introduced the concept of R-integrality of an element of a field E for 
a subring R of E. The concept and elementary results derived in BAI can be extended to the 
general case in which F is an arbitrary commutative ring. We have the following 


DEFINITION 7.2. Jf E is a commutative ring and R is a subring, then an element u € E is 
called R-integral if there exists a monic polynomial f (x) € R[x], x an indeterminate, such 


that f (u) = 0. 


If f(x) = x"— an- 1x"7'— +" ao a€R, then we have the relation u” =ayg+a,ut+-"+a,_,u"~ 


from which we deduce that if M = R1 + Ru + + Ru"~', then uM c M. Evidently M is a 
finitely generated R-submodule of E containing 1. Since uM c M, M is an R[u]-submodule of 
E and since 1 € M, M is faithful as R[u]-module. Hence we have the implication 1 = 2 in the 
following. 


LEMMA. The following conditions on an element u € E are equivalent: 1. u is R-integral. 
2. There exists a faithful R{u|-submodule of E that is finitely generated as R-module. 


Proof. Now assume 2. Then we have u;e M such that M = Ru, + Ru, + + + Ru,. Then we 


have the relations wu; =)"; = | aju;, 1 < i < n, where the a; € R. 


Hence we have the relations 


(U—a,,)U, —Qy,U,—*** —a,,u, = 0 


— Azu; + (u— az) — *** —a>,U, = 0 


— ğu — hiy — A (Ukat = 9. 


If we multiply the ith of these equations by the cofactor of the (i, j)-entry of the matrix u1 — (a,j) 
and add the resulting equations, we obtain the equation f(u)u; = 0, 1 <j < n, where f(x) is the 
characteristic polynomial of the matrix (a;). Since the u; generate M and M is faithful as R[w]- 
module, we have f(u) = 0, so u is a root of the monic polynomial f(x). Hence2+1. O 


We remark that the argument just used is slightly different from the one given in the field 
case in BAI, p. 279. If M and N are R-submodules of E that are finitely generated over R then 
so is MN = {)'m,n; \m; € M,n,;€ N}. Moreover, if 1 € Mand 1 € Nthenl € MN and if either 


uM c M or uN c N then uMN c MN. These observations and the foregoing lemma can be used 
in exactly the same way as in BAI, pp. 279-280, to prove 


THEOREM 7.4. If E is a commutative ring and R is a subring, the subset R' of elements of 
E that are R-integral is a subring containing R. Moreover, any element of E that is R'- 
integral is R-integral and hence is contained in R'. 


The subring R’ of E is called the integral closure of R in E. If R' = E, that is, every element 
of E is integral over R, then we say that £ is integral over R (or E is an integral extension of 
R). 

If A is an ideal of a ring £, then evidently R N A is an ideal in the subring R of E. We call 
this the contraction of A to R and denote it as A°. We also have the subring (R + A)/A of E/A 
and the canonical isomorphism of this ring with the ring R/AS = R/(R N A). One usually 
identifies (R + A)/A with R/A° by means of the canonical isomorphism and so regards E/A as an 
extension of R/A®. In this sense we have 


PROPOSITION 7.15. Jf E is integral over R and A is an ideal in E, then E = E/A is 
integral over R = R/AS. 


Proof. Let#=u+ Abe an element of E. Then we have a; € R,O0<i<n-—1, such that u” = 
agot aju + + ap qU? |. Then # = 4o+a,t+ + +4,-,H""" where a; = a; + A. Hence ñ is integral 
over (R + .A)/A, hence over R = R/AS. O 


Next suppose that S is a submonoid of the multiplicative monoid of R. Then the localization 


E, contains the localization R, as a subring. Moreover, we have 


PROPOSITION 7.16. Let R be a subring of E, R' the integral closure of R in E, S a 
submonoid of the multiplicative monoid of R. Then R's is the integral closure of R, in E,. 


Proof. Any element of R', has the form u/s, u € R', s e S. We have a relation u” = ag t a; u + 
= +a,_4,u"—!, a, € R. Hence (u/s)" = ao/s" + (a1/s"'Xu/s)+ + + (dn 1/su/sY""', Hence u/s is 
integral over R,. Conversely, suppose that u/s is integral over R, where u € E, s e S. To show 


that u/s = v/t where v e R', t €e S, it suffices to show that there exists ans’ € S such that us’ €e R 
’. For, u/s = us'/ss' has the required form. Now u/S integral over R, implies that u/1 is integral 
over R,, since s/l e R, and u/1 = (u/s)(s/1). Thus we have a relation of the form 
(u/1)" = do/so + (41/81 )(u/1)+ + +(an-1/Sn-1)(u/1)"7* with a; € R, s; e S. Multiplication by 
ti/1 where t; = | |6 * s: gives (t,u/1)" = ao/l +(a4,/1)(t,u/1)+ = + (an-1/D(t1u/1 7" where the a’ € R. Hence 
[(t yu)" — ao — ailtiu)— -+ — an- altiu)" "V1 = Then there exists a t, € S such that ta[(tiu)"— ao =] = 0, 


Multiplication of this relation by £”; ! shows that s' u € R' for s' =t; t. O 


We prove next 


PROPOSITION 7.17. Jf E is a domain that is integral over the subdomain R, then E is a 
field if and only if R is afield. 


Proof. Assume first that R is a field and let u + 0 be an element of E. We have a relation u” + 
ayu”! + +a, =0 witha; € R and since u is not a zero divisor in E, we may assume a, # 0. 
Then a~ |,, exists in R and we have “(u""' +aiu""* +-+ + an- 1)(— as *)= 1, Hence u is invertible and 
E is a field. Conversely, suppose that Æ is a field and let a 4 0 be in R. Then a ! exists in E 


and we have a relation (@ ‘tala 'f' +- +a = 0 with aeR Multiplication by a” ~ | gives 
a~' = = (aitaa + + and" NER. Hence a is invertible in R and R is a field. O 


An immediate consequence of this result and of Proposition 7.17 is the 


COROLLARY 1. Let E be a commutative ring, R a subring such that E is integral over R, 


and let P be a prime ideal in E. Then P° = P N R is maximal in R if and only if P is maximal 
in E. 


Proof. E = E/P is a domain and by Proposition 7.15, E is integral over R = R/P°. By 
Proposition 7.17, E is a field if and only if R is a field. Hence P is maximal in Æ if and only if 
P° is maximal in R. O 


We also have the following 


COROLLARY 2. Let E and R be as in Corollary | and suppose P, and P, are ideals in R 
such that P4 ? P3. Then PS4 ? Ph. 


Proof. We have P*, > P°,. Now suppose p = P*, = P°,. Consider the localization E£, for S = 
R — p. By Proposition 7.16, Ey is integral over Ry. Since P; N R =p, P; N S = Ø. Then, by 
Proposition 7.8, Pis ? P55. On the other hand P;; > pg which is the maximal ideal of the local 
ring Ry. Since Ey is integral over Ry it follows from Corollary 1 that P;; is maximal in £5. This 


contradicts Pig ? Pas. Thus PS4 ? Pe. O 


If P is a prime ideal in Æ, then it is clear that P° is a prime ideal in R. Hence we have a map 
P ~ P© of Spec E into Spec R. This is surjective since we have the following 


THEOREM 7.5 (“LYING-OVER” THEOREM). Let E be a commutative ring, R a 


subring such that E is integral over R. Then any prime ideal p of R is the contraction P® of a 
prime ideal P of E. 


Proof. We assume first that R is local and p is the maximal ideal of R. Let Pbe a maximal 
ideal in E. Then P° is a maximal ideal in R, by the above corollary. Since p is the only 
maximal ideal in R, we have p = P*. 

Now let R be arbitrary and consider the localizations R, and E,,. R, is a local ring and E,, is 
integral over R, (Proposition 7.16). Now there exists a prime ideal P' in £, whose contraction 
P’°= P' N R, = pp, the maximal ideal of R,. By Proposition 7.8 (p. 401), P' = P, for a prime 
ideal P of E such that P N (R — p) = Ø or, equivalently, P R c p. Again, by Proposition 7.8, 
P=j(P,) = {ue Elu/s € P,} for some s € R- p. Now p C j(p,) Cj(P,) =P. Hence P N R= 
p. O 


EXERCISES 


1. Let E be a commutative ring, R a subring such that (1) E is integral over R and (2) E is 
finitely generated as R-algebra. Show that F is finitely generated as R-module. 


2. (“Going-up” theorem). Let F be a commutative ring, R a subring such that F is integral 
over R. Let p; and p, be prime ideals of R such that p, > p, and let P, be a prime ideal 
of E such that P°, = p>. Show that there exists a prime ideal P, of E such that P; > P, and 


PY = pp 


3. Let R be a subring of a commutative ring £, R' the integral closure of R in Æ, J an ideal in 
R, I = IR' its extension to an ideal in R’. An element a € E is integral over I if it 
satisfies an equation f(a) = 0 where f(A) =1"+ b,A"—-!+ = + b, b; € I. The subset 7' of 
E of elements integral over Z is called the integral closure of I in E. 

Show that 7' = nilrad 7° in R'. Sketch of proof: Ifa € E and a"+b,a™!+- +b, =0 
with the b; e J then a €e R' and a” e I°. Hence a e nilrad X° in R'. Conversely, let ae 
nilrad Z° in R'. Thena €e R' and a” = $” a,b; for some m > 0, a; € R', b; € I. Let A be the 
R-subalgebra of E generated by the a;. By exercise 1, A has a finite set of generators ay, 
++) Ag, q Z n, as R-module. Then 44) = Li-ibat%, 1 Sj<4 where the bi, € I. As in the proof 
of the lemma on p. 408, this implies that a” and hence a is integral over Z. 

Remarks, (i) I' is a subring of E (in the sense of BAI, that is, /' is a subgroup of the 
additive group closed under multiplication), (ii) If R is integrally closed in E (R’ = R) 
then 7' = nilrad Jin R. 

4. (Basic facts on contractions and extensions of ideals.) Let R be a subring of a 
commutative ring E. For any ideal J of E, © =I N R is an ideal in R and for any ideal i of 
R, i? = iE is an ideal in E. Note that 


LoLehol 
i Pie ihm h 
= 4, Pei. 
Hence conclude that 

pee = iv, 7 = I‘ 


for any ideals i and 7. Note that these imply that an ideal i of R is the contraction of an 
ideal of E « i =i and an ideal J of E is an extension of an ideal of R « I = °° 


7.7 INTEGRALLY CLOSED DOMAINS 


An important property of domains that we shall encounter especially in the study of Dedekind 
domains (see Chapter 10) is given in the following 


DEFINITION 7.3. A domain D is called integrally closed if it is integrally closed in its 
field of fractions. 


It is readily seen that Z is integrally closed. More generally any factorial domain is 
integrally closed (exercise 1, below). Our main objective in this brief section is to prove a 


“going-down” theorem (Theorem 7.6) for integrally closed domains that will be required later 
(in the proof of Theorem 8.37). For the proof of this theorem we require 


PROPOSITION 7.18. Let D be an integrally closed subdomain of a domain E, F the field 
offractions of D, I an ideal in D. Let a € E be integral over I. Then a is algebraic over F 
and its minimum polynomial MÀ) = 2" — ba"! += + (—1)"bm over F has its coefficients (— 
1yb, € nilrad 1,1 <j <m. 


Proof. The first statement is clear. For the second let S be a splitting field over F of m(A) 
and let m(A) = (A — a) (À — am) in S[A] where a, = a. For any i we have an automorphism of 
S/F such that a ~ a;. Since this stabilizes / it follows that every a; is integral over 7. Since b, 1 
<j <m, is an elementary symmetric polynomial in the a; it follows from Remark (i) following 
exercise 3 of section 7.6 that b, is integral over /. Then, by Remark (ii), b, € nilrad I. O 


We shall need also the following criterion that an ideal be a contraction of a prime ideal. 


PROPOSITION 7.19. Let R be a subring of a commutative ring E, p a prime ideal of R. 
Then p = P = P N R for a prime ideal P of E œ pë = Ep N R =p. 


Proof.  Ifp = P then p® = p by exercise 4 of section 7.6. Now assume this holds for a prime 
ideal p of R. Consider the submonoid S = R — p of the multiplicative monoid of E. Since p® = 
p, Ep N R =p and hence Ep N S= Ø. Since Ep is an ideal of E and Ep N S = Ø, the extension 
(Ep), is a proper ideal of the localization E, of E relative to the monoid S. Then (Ep), is 


contained in a maximal ideal Q of E, and P = j(Q) as defined by (18) is a prime ideal of R 
such that P N (R - p) = P N S= Ø. Then P N R cp. Since P = (0) > j((Ep);) > p we have P° 
=PNR=p. D 


We can now prove 


THEOREM 7.6 (“Going-down” theorem). Let D be an integrally closed subdomain of a 
domain E that is integral over D. Let p, and p, be prime ideals of D such that p; > p, and 


suppose P, is a prime ideal of E such that P°, = p,. Then there exists a prime ideal P, of E 
such that P° = p, and P, C P}. 


Proof. Consider the localization £, (= Ez — p,). It suffices to show that pEp N D = pp. For, 
if this holds, then by Proposition 7.19 there exists a prime ideal Q of E Pi such that D N Q = pz 


Then P, = j(Q) is a prime ideal of ŒE contained in P, and 
P,AD={(Q)VND=fQNEND=QND= py. 


We now proceed to the proof that p E p, N D =p Leta € poEp. Then a = b/s where b € 
Ep, and s € E -— P, By exercise 3 of section 7.6, b is integral over p, and hence by Proposition 


7.18, the minimum polynomial of b over the field of fractions F of D has the form 
am — bå"! + +> + (—1)"b,, where biep, 1 <i <m. 


Suppose a = b/s € DoEp, N D. Then s = b/a and the minimum polynomial of s over F is 
a" — (b,/a)A" t +++ + (— D"bn/a"), Since s is integral over D, taking Z = D in Proposition 7.18, we 
see that every b;/a' € D. Then (b,/a')a' € p, and b;/a' and a' € D. Now suppose a € py. Then, 
since p, is prime in D, b/a € p>, 1 < i < m. Thus s is integral over p, and hence over p,. Then 
again by exercise 3, s € nilrad p, E c P} contrary to s e E — P, This shows that a € p, so 
P2Ep, N D C po. Since the reverse inequality is clear we have p Ep N D=p,. O 


EXERCISES 


1. Show that any factorial domain is integrally closed. 


2. Let D be a domain, F its field of fractions. Show that if D is integrally closed then D, is 


integrally closed for every submonoid S of the multiplicative monoid of R. On the other 
hand, show that if Dp is integrally closed for every maximal ideal P of D then D is 


integrally closed. (Hint: Use Proposition 7.11 on p. 402.) 


7.8 RANK OF PROJECTIVE MODULES 


We have shown in BAI, p. 171, that if R is a commutative ring and M is an R-module with a 
base of n elements, then any base has cardinality n. Hence the number n, called the rank of M, 
is an invariant. We repeat the argument in a slightly improved form. Let {ejl < i < n} be a base 


for M and let {f| 1 <j < m } be a set of generators. Then we have 
fi= Xi anen e= Li buh = Ley Outy’s which gives Limi baay =p 1S I<", Assume n > m and 
consider the n x n matrices 


43; 432 *"" Ain On Oin 
bz bim 

sa Omi Am2 mn os ' 0 
0 bpi b 


Then we have BA = 1,, the n x n unit matrix. Since R is commutative, this implies AB = 1, 


(BAI, p. 97), which is impossible if n > m. Hence n = m. Thus we see that if M has a base of n 
elements, then any set of generators contains at least n elements. Hence any two bases have the 


same cardinality. The argument shows also that any set of n generators f; = ia; e; 1S j <n, is 
a base, since the argument shows that in this case the matrix A = (a,;) is invertible in M,(R), 
and this implies that the only relation of the form >’cf; = 0 is the one with every c; = 0. We 
summarize these results in 


PROPOSITION 7.20 Let M be a free module over a commutative ring with base of n 
elements. Then (1) any base has cardinality n, (2) any set of generators contains at least n 
elements, and (3) any set of n generators is a base. 


We shall now give a method, based on localization, for extending the concept of rank to 
finitely generated projective modules over commutative rings R. We shall prove first that such 
modules are free if R is local. For this we require an important lemma for arbitrary rings 
known as 


NAKAYAMA’S LEMMA. Let M be a module over a ring R (not necessarily commutative). 
Suppose that (1) M is finitely generated and (2) (rad R)\, = M. Then M = 0. 


Proof. Let m be the smallest integer such that M is generated by m elements x), X,..., Xm If 
M £0, then m > 0. The condition (rad R)M = M implies that x, = rx] + °° + 7), where the r; 
e rad R. Then we have 


(L—T,.)X—_ = rX t thy 1 Xp -4: 


Since r„ € rad R, 1 —r,, is invertible in R, and acting with (1 — r,,)~ | on the foregoing relation 
shows that x,,, can be expressed in terms of x),...,x,,_ 1. Then x},..., Xm 1 generate M, contrary 
to the choice ofm. O 


We recall that if B is an ideal in a ring R and M is an R-module, then BM is a submodule 
and the module M/BM is annihilated by B and so can be regarded in a natural way as R/B- 
module. In particular this holds for B = rad R. In this case we have the following consequence 
of Nakayama’s lemma. 


COROLLARY. Let M be a finitely generated R-module. Then x), ..., X, E M generate M if 
and only if the cosets ¥ = x, + (rad R)M,..., Xm = Xm + (rad R)M generate M/(rad R)M as 
R/rad R module. 


Proof. If x, ... Xm generate M, then evidently %, ..., ¥%,, generate M = M/(rad R)M as R- 


module and, equivalently, as R = R/rad R-module. Conversely, suppose the *, generate M as R- 
module, hence as R-module. Then M = (rad R)M + N where N = } Rx;. Then (rad R)(M/N) = 


MIN and M/N is finitely generated. Hence M/N = 0 by Nakayama’s lemma, so M = N = 
ye Rx;. O 


We can now prove 


THEOREM 7.7. Jf R is a (commutative) local ring, then any finitely generated projective 
module over R is free. 


Proof. We may assume that M is a direct summand of a free module F = R” : F=M @® N 
where M and N are submodules. Then (rad R)F = (rad R)M + (rad R)N and (rad R)M c M, 
(rad R)N C y. Hence F = F/(rad R)F =M ƏN where M = (M + (rad R)F)/(rad R)F, Ñ = (N+ 
(rad R)F)/(rad R)F. Now R = R/rad R is a field and evidently F is an n-dimensional vector 
space over R, and M and Ñ are subspaces. We can choose elements y}, ..., y,, SO that y; ... y, € 
M, y,.4 --- Yn E Nand if ¥;=y,+ (rad R)F, then {¥, ..., ¥,,} is a base for F over R. Then, by 
the Corollary to Nakayama’s lemma, y}, ..., vy, generate F and, by Proposition 7.18, they form a 
base for F. Then {y}; ..., y,} is a base for M so Mis free. O 


We now consider finitely generated projective modules over an arbitrary ring K and we 
prove first 


PROPOSITION 7.21. Jf M is a free R-module of rank n then M, is a free R -module of 


rank n for any submonoid S of the multiplicative monoid of R. If M is projective with n 
generators over R, then M, is projective with n generators over R,. 


Proof. To prove the first statement we recall that S-localization is a functor from R-mod to 
R,-mod. Hence a relation M = M} ® = ®M, for R-modules implies M, = M,;® | ®M,¢ for 
R,-modules. Then M = R® * @R (n copies) implies M, = R,® ©: ®Rg (n copies). Next 
suppose M is projective with n generators. This is equivalent to assuming that M is isomorphic 
to a direct summand of the free module R™. Then M, is R,-projective with n generators. O 


In particular, if M is projective with n generators, then for any prime ideal P, Mp is 
projective with n generators over the local ring Rp. Hence Mp is Rp-free of rank np < n. We 
call np the P-rank of M and we shall say that M has a rank if np = ng for any two prime ideals 
P and Q of R. In this case the common value of the local ranks np is called the rank of the 
finitely generated projective module M. We shall now show that the map P ~ np is a 


continuous map of Spec R into Z endowed with the discrete topology. This will imply that if 
Spec R is connected or, equivalently, R has no idempotents + 0, 1, then M has a rank. 


The continuity we wish to establish will be an easy consequence of 


PROPOSITION 7.22. Let M be a finitely generated projective R-module, P a prime ideal 
in R. Then there exists a € P such that Ma is free as Reay-module. 


Proof. Mp is a free Rp-module of finite rank. If (x,/s), ..., x,/s,), x; E M,s;¢ R-P,isa 
base for Mp, and since the elements s;/1 are units in Rp, (x/1, ..., x,,/l) is also a base. Consider 


the R-homomorphism f of R™® into M such that (a, ...,a„)> Jax; We have the exact sequence 


f \ 
0> kerf R" >M > coker f> 0. 
Localizing at P, we obtain the exact sequence 
lp Fi v} 
0> ker fp > RP > Mp > coker fp > 0. 
Since Mp is free with base (x/l, ..., xX„/1), fp is an isomorphism and hence ker fp = 0, coker f, 
= 0. Since M is finitely generated, so is its homomorphic image coker f and since coker fp, = 


(coker f)p, it follows from Proposition 7.5 that we have an element b É P such that b(coker f) = 
0. Then (coker f),,, = 0 and we have the exact sequence 


>0. 


; Ib) AS 
0— ker fp — RY — Map 


Since Mæ is projective, this splits, so ker fs, is isomorphic to a homomorphic image of R™).,, 
and hence this is finitely generated. Since > is a submonoid of R — P, 0 = (ker f)p = ker f,, is 


obtained by a localization of (ker f),,, (Proposition 7.4). Since (ker f),,, is finitely generated, 
this implies that there exists an element c/1, c É P such that (c/1) (ker fjæs = 0. Then ker fire; = 
0. Hence if we put a = bc, then a É P and we have 


à hay 
v-— RY —M Ea T 0, 


so May = R 4 is free. This complates the proof. O 
We now have 


THEOREM 7.8. The map ry: P ~ np(M) of Spec R into Z (with discrete topology) is 
continuous. 


Proof Let P be any point in Spec R. We have to show that there exists an open set O 
containing P such that no = np for all Q in O. Take O = X, where a is as in Proposition 7.20. 


Then P € X, and if Q € X,, then Mo is a localization of Mca, which is a free Riw)-module of 
rank n. Then the rank of Mọ is n. Thus ng =n = np. O 


If Spec R is connected, the continuity of the map rọ implies that np = ng for all B Q € Spec 


R. Hence we have the 


COROLLARY. Jf R is connected, that is, R has no idempotents + 0, 1, then the rank is 
defined for every finitely generated projective R-module. 


Again let M be a finitely generated projective module and consider the continuous map rwy 
of Spec R into Z. Since Z has the discrete topology, any integer n is an open and closed subset 
of Z and r~} „(n) is an open and closed subset of Spec R. Moreover, Spec R is a disjoint union 
of these sets. Since Spec R is quasi-compact, the number of non-vacuous sets 7! y(n) is finite. 
Thus we have positive integers n, ..., n, such that 7! y(n) # Ø and X= Spec R = Ur! y(n). 
By Theorem 7.3, the open and closed subset 7!,,(n;) has the form_X,, for an idempotent e; € R. 
Since Tain) (\ra'(n) = @ ifi¥j , ee; = 0 for i # j and since Ura'(n) =X. De=1. Thus the e; are 
orthogonal idempotents in R with sum 1 and hence R = R,® + ®Ry where R; = Re; and M = M} 
® - ƏM, where M; = eM = R;M. It is clear from the dual basis lemma (Proposition 3.11, p. 
152) that M; is a finitely generated projective R;-module. We claim that M; has a rank over R; 
and this rank is n;. Let P, be a prime ideal in R;. Then P = Pi+2Xs+:%; is a prime ideal in R not 
containing e;, so P € X, and P contains every e;, j # i. We have Rr = Rir®*** @Rsr and since e; 
€ P and eR; = 0 for j + i, Rp = Rip. Similarly Mp = Mjp. Hence the rank of Mjp over Rip is n; 
We have the homomorphism r ~ re; of R into R; in which the elements of R — P are mapped into 
elements of R--P;. Following this with the canonical homomorphism of R, into Rip, gives a 
homomorphism of R into R,p, in which the elements of R-P are mapped into invertible 
elements. Accordingly, we have a homomorphism of Rp into R;p, and we can use this to regard 
Rip, as Rp-module. Then M;p, = Rip ®R,M;jp, and since Mjp is a free R;p module of rank n;, Mjp; 
is a free Rjp, module of rank n;. Since this holds for every prime ideal P, of R;, we see that the 


rank of M; over R; is n;. A part of what we have proved can be stated as 


THEOREM 7.9. Let M be a finitely generated projective module over a commutative ring 
R. Then there are only a finite number of values for the ranks np(M) for the prime ideal P of 


R. If these values are ny,...,n,, then R= R® ` ®Rg where the R; are ideals such that R;M is 
finitely generated projective over R; of rank n,. 


EXERCISES 


1. Let M be a finitely generated projective module over the commutative ring R and let P be 
a maximal ideal of R. Show that if M has a rank over R, then this is the dimensionality of 
MIPM regarded as vector space over the field R/P. 


2. Let M and R be as in exercise 1 and assume that R is an algebra over R’ such that R is 
finitely generated projective over R'. Suppose that M has a rank over R and R has a rank 
over R’. Show that M has a rank over R' and rank M/R' = (rank M/R) (rank R/R). 


3. Show that if D is a domain, then any finitely generated projective module M over D has a 
rank and this coincides with the dimensionality of F® pM over F where F is the field of 
fractions of D. 


7.9 PROJECTIVE CLASS GROUP 


We now make contact again with the Morita theory. We recall that if R' and R are rings, an R’ - 
R-bimodule M is said to be invertible if there exists an R-R'-bimodule M’ such that M' ® pM = 
R as R-R-bimodule and M® pM'= R' as R'-R' -bimodule. This is the case if and only if M is a 
progenerator of mod-R (R'-mod). Then R' = End Mp as rings and M'= hom (Mp, Rp) as R-R' - 
bimodule. If R' = R, the isomorphism classes of invertible bimodules form a group Pic R in 
which the multiplication is given by tensor products. Now suppose R is commutative. We 
restrict our attention to the R-R-bimodules in which the left action is the same as the right 
action. In effect we are dealing with (left) R-modules. The isomorphism classes of invertible 
modules constitute a subgroup of Pic R that is called the projective class group of the 
commutative ring R. The following result identifies the modules whose isomorphism classes 
constitute the projective class group. 


THEOREM 7.10. Let R be a commutative ring, M an R-module. Then M is invertible if 
and only if it is faithful finitely generated projective and rank M = 1. 


Proof. Suppose that M is invertible and M’ is an R-module such that M®pM'= R. Then M 
and M' are finitely generated projective. Let P be a prime ideal in R. Then M® pM'= R implies 
that Mp® rM ' p = Rp. Since these modules are free of finite rank over the local ring Rp, this 
relation implies that rank Mp/p, = 1. Since this holds for all P, we see that rank M = 1. 
Conversely, suppose that M satisfies the stated conditions and let M* = homp(M,R). Since M is 


faithful finitely generated projective, M is a progenerator by Theorem 3.22. Hence, as in the 
Morita theory, we have the isomorphism u of M® pM* onto EndpM sending x®y*, x € yy, y* € 


M*, into the map y ~ y*(y)x. Since R is commutative, any r €e R determines an R- 
endomorphism y ~ ry. We can identify R with this set of endomorphisms and so Endg M > R. 


Hence to show that M ®p M* = R, it suffices to prove that EndpM = R. This will follow if we 
can show that (EndpM)p = Rp for every prime ideal P. Now (EndrM)p = Rp® pEndpjy Since M 
is finitely generated projective, it is easily seen that Rp® pEndgM = EndR,,M,, (see the proof of 
Proposition 3.14, p. 154). Now if M is a free module of rank n over a commutative ring R, then 
EndpM is a free module of rank n? over R (= M,,(R)). By hypothesis, Mp has rank 1 over Rp. 


Hence Endp,Mp has rank 1 over Rp. Thus (EndgM)p has rank 1 over Rp and hence (EndgM)p = 
Rp for every P. This completes the proof. O 


We shall not give any examples of projective class groups at this point. Later (section 10.6) 
we shall see that if R is an algebraic number field, then this group can be identified with a 
classical group of number theory. 


7.10 NOETHERIAN RINGS 


In the remainder of this chapter we shall be interested primarily in noetherian rings and 
modules. We recall that the noetherian condition that a module satisfy the ascending chain 
condition for submodules is equivalent to the maximum condition that every non-vacuous set of 
submodules contains submodules that are maximal in the set, and equivalent to the “finite basis 
condition” that every submodule has a finite set of generators (p. 103, exercise 1). We recall 
also that a (commutative) ring is noetherian if it is noetherian as a module with respect to 
itself, which means that every properly ascending chain of ideals in the ring terminates, that 
every non-vacuous set of ideals contains maximal ones, and that every ideal has a finite set of 
generators. Moreover, any one of these conditions implies the other two. In 1890 Hilbert based 
a proof of a fundamental theorem in invariant theory on the following theorem : 


HILBERT’S BASIS THEOREM. /f R is a field or the ring of integers, then any ideal in 
the polynomial ring R[X1,X, ..., X,], x; indeterminates, has a finite set of generators. 
Hilbert’s proof admits an immediate extension of his basis theorem to 


HILBERT’S (GENERALIZED) BASIS THEOREM. /f R is a ring such that every ideal 
in R is finitely generated, then every ideal in R[x,,... ,X,] is finitely generated. 


Proof. Using induction it evidently suffices to prove the theorem for the case of one 
indeterminate x. We have to show that any ideal B of R[x] has a finite set of generators. Let j = 
0, 1,2,... and let T I(B) be the set of elements b; € R such that there exists an element of the 


form 
fi=b;xt+aj-;x' + +ageB. 


It is evident that /; is an ideal in R, and since f; € B implies that xf; = b;x/*! +: € B, itis clear 
that J; c J, ; Hence J = U7, is an ideal in R. This has a finite set of generators, and we may 
assume that these are contained in one of the ideals J,. If {b,{)), ... ,b'”,,} are these 
generators, we have polynomials /m’ = bwx" +g% EB where deg ¢,, < m. Similarly, since every 
I, is finitely generated for j < m, we have polynomials f = bix’ +gieB where deg g, < j 
and 1 <i; <k;. We claim that the finite set of polynomials 


FAT) (ko) (ly (ki) (1) (k)) 
dit de rend inca bout 


generate B. Let fe B. We prove by induction on n = deg f that f= Lwhuf!” for suitable h; € 
R[x]. This is clear if f= 0 or deg f= 0, so we assume that f= b,x” + fı where b, # 0 and deg fi 
<n. Thenb, € I, cI. Ifn > m, then b, = Liaibm ae R, and Zax" "fa’EB and has the same leading 
coefficient as f. Hence deg (f — Yax" "®© ) < n. Since this polynomial is in B, the result 
follows by induction. If n < m, we have b, = 5s = Leaibs’, aeR, and deg (f- 5a, fs”) <n, Again the 
result follows by induction. O 


An alternative version of Hilbert’s theorem is that if R is noetherian, then so is the 
polynomial ring R[x ..., x,]. We also have the following stronger result. 


COROLLARY. Let R be noetherian and R' an extension ring of R, which is finitely 
generated (as algebra) over R. Then R'is noetherian. 


Proof. The hypothesis is that R'= R[u};, ..., u,] for certain u € R'. Then R'is a homomorphic 
image of R[x), ..., x,.], x; indeterminates. Since R[x, ..., x,.] is noetherian, so is R’. O 
There is another important class of examples of noetherian rings: rings of formal power 


series over noetherian rings. If R is an arbitrary ring, we can define the ring R[[x]] of formal 
power series over R as the set of unrestricted sequences 


(19) a = (90,41,42. ..), 


a; € R with addition defined component-wise, 0 = (0, 0, ...), 1 = (1, 0,0, ...), and product ab 
for a as above and b = (bọ,b4, ...) defined as c = (co, Cp ...) where 


j+k=i 

It is easily checked that R[[x]] is a ring (exercise 7, p. 127 of BAI). It is clear that R can be 
identified with the subring of R[[x]] of elements (aọ,0, 0, ...) and R[x] with the subring of 
sequences (ao, 41, ..., a,,0, 0, ...), that is, the sequence having only a finite number of non-zero 
terms (BAI, pp. 116—118). 

In dealing with formal power series, the concept of order of a series takes the place of the 
degree of a polynomial. If a = (dp, a, ...), we define the order o(a) by 
(21) (a o if a=0 

oaj = . 
k if ay =" =a,., = 9, a, # 9. 


Then we have 


(22) o(ab) > o(a)+o(b) 

and 

(23) o(a+b) > min (o(a), o(b)). 

If R is a domain, then (22) can be strengthened to o(ab) = o(a) + o(b). In any case if we define 
(24 al = 2-8 


(with the convention that 2~ ° = 0), then we have the following properties of the map a ~ |a| of 
R{[x]] into R: 

(i) la| => 0 and |a| = 0 if and only ifa = 0. 

(ii) ja + b| < max (a|, |b). 

(111)|ab| < Jaļ| |b}. 


The second of these implies that ja + b| < la| + |b|. Hence (i) and (ii) imply that R[[x]] is a 
metric space with distance function d(a, b) =la — b|. We can therefore introduce the standard 
notions of convergence of sequences and series, Cauchy sequences etc. 

We say that the sequence {a}, a © R[[x]], i = 1, 2, 3,..., converges to a € R{[x]] if for 
any real e > 0 there exists an integer N = N(e) such that |a — a| < e for all i > N. Then we write 
lim a = a. The sequence {a} is called a Cauchy sequence if given any ¢ > 0 there exists an 
integer N such that ja — a”| < e for all i,j > N. R[[x]] is complete relative to its metric in the 
sense that every Cauchy sequence of elements in R[[x]] converges. For, let {a} be a Cauchy 
sequence in R[[x]]. Then for any integer n > 0 we have an N such that o(a — a) > n or all i,j 
> N. Let a, be the entry in the (n + 1)-st place of a“). Then every a“), i > N, has this element as 


its (n + 1)-st entry. It is readily seen that if we take a = (aọ,41,42, ...), then lim a® =a, 
We can also define convergence of series in the usual way. We say that a“) + a@+-- =a if 
the sequence of partial sums a“),a + a@,... converges to a. 


If we put x = (0, 1,0,...), then x’ has 1 in the (7 + 1)th place, 0’s elsewhere. It follows that if 
a = (d,4,a>, ...) and we identify aj with (a;,0, 0, ...), then we can write 


(25) lapas) = Ag FOX FOX + ove: 


It is clear that if R is a domain, then so is R[[x]]. It is also easy to determine the units of 
R{[x]], namely, a,x’ is a unit if and only if ay is a unit in R. The condition is clearly necessary. 


To see that it is sufficient, we write )%, a,x’ = ag(1 — z) where z = X°} bx and b, =— ag CA 
Then o(z) > 1 and o(z*) > k. Hence 1 +z +z? + ~ exists. It is readily seen that 


(26) (1—z)(1+z427+°::)=1. 


Hence $”) ax’ = ap(I — z) is a unit with inverse ay ! ( Yo z. It is clear also that the set of 
power series ),b,x! of order > 0 is an ideal in R[[x]]. An immediate consequence of these 
remarks is 


PROPOSITION 7.23. Jf F is a field, then F[[x]] is a local ring. 


Proof. The set of elements Y* a,x’ that are not units is the set for which ay = 0. This is an 
ideal. Thus the non-units constitute an ideal and hence F[[x]] is local. O 


It is easily seen also that if R is local, then so is R[[x]]. We leave the proof to the reader. 
We shall prove next the following important 


THEOREM 7.11. JfR is noetherian, then so is R{[|x]]. 


Proof. The proof is quite similar to the proof of the Hilbert basis theorem. Let B be an ideal. 
For any j = 0, 1,2,... let 7 be the set of b; € R such that there exists an element of = b,x! +g e 
B where o(g;) >j. Then J; is an ideal in R and Jọ C1, C..., sol = Ul; is an ideal in R. Since R 
is noetherian, J has a finite set of generators and all of these are contained in Z, for some m. Let 
these generators be b,,) ..., b,,“. It is clear from (22) and (23) that the set of elements of 
R[[x]] of order > m form an ideal. The intersection of this set with B is an ideal B,, in R[[x]] 
containing the elements /,,, 1 <i < k. Now let fe B,, and suppose f= b,x” + g where b, € R, 
n > mand o(g) > n. Then b, = Ya; for a; € R and 


o| 1- X. ax) >n. 
i 


Iteration of this process yields a sequence of integers n; =n —m < m <n; and elements a, € 
R,1<i<k,j=1, 2,... such that 


k r 
0 [/- by (5 ays") e) >n+n,, 
int \j=1 


r= 1, 2,.... Then a;= $°”, -4 ajx” is well defined and f= Et- 1af». Now consider the ideals 
L, 0 <j <m. Choose a set of generators {b(),,...,b,*9} for J, and Si = by'x) +) EB with olp)’ > j, 


Then, as in the polynomial case, 


{ ¢(1) ‘(ko) (1) *(k)) 
ito goer É geerg lé PAR: S j 


is a set of generators for B. O 


We can iterate the process for forming formal power series to construct R[[x, y]]= (R[[x]]) 


[Lv]], etc. We can also mix this construction with that of forming polynomial extensions. If we 
start with a noetherian R and perform these constructions a finite number of times, we obtain 
noetherian rings. 


Another construction that preserves the noetherian property is described in the following 


THEOREM 7.12. Let R be noetherian and let S be a submonoid of the multiplicative 
monoid ofR. Then the localization R, is noetherian. 


Proof. Let B'be an ideal in R,. As on p. 401 let j(B’) be the set of elements b € R such that 
b/s € B' for some s € S. Then j(B’) is an ideal in R and j(B'), = B' . Since R is noetherian, j(B' 
) has a finite set of generators {b,, ..., bm}. Then the set {b,/1, ..., b,,/1} C= B' and generates 
this ideal. D 


EXERCISES 


1. Show that if J is an ideal in a commutative ring R such that / is not finitely generated and 
every ideal properly containing / is finitely generated, then / is prime. Use this to prove 
that if every prime ideal in R is finitely generated, then R is noetherian. 


2. If R is a ring, we define the ring of formal Laurent series R((x)) over R as the set of 
sequences (a;), — 0 < i < œ, a; E R such that there exists an integer n (depending on the 
sequence) such that a; = 0 for i < n. Define addition and multiplication as for power 
series. Show that R((x)) is a localization of R[[x]] and hence that R((x)) is noetherian if 
R is noetherian. Show that if R is a field, so is R((x)), and R((x)) contains the field R(x) 
of rational expressions in x. 

3. (Emmy Noether’s finiteness theorem on invariants of a finite group.) Let E = F[wy,...,u,)] 
be a finitely generated algebra over the field F and let G be a finite group of 
automorphisms of E/F. Let Inv G = {y € plsy =y, s € G}. Show that Inv G is a finitely 


generated algebra over F. (Sketch of proof. Let f(x) = [le G(x — su) = x" — Pax” = 1 + 
nada, 


Pix , where x is an indeterminate. Then J = F[pq,..- Pmm] C Inv G and E is 


integral over /. Hence, by exercise 1, p. 411, E is a finitely generated /-module. Since / 
is noetherian, Inv G is finitely generated, say by vı, ..., v, Then Inv G = Fp, p ... 


Pmn:Y1s -<-> Velo) 


7.11 COMMUTATIVE ARTINIAN RINGS 


The study of artinian rings constitutes a major part of the structure theory of rings as developed 


in Chapter 4. It is interesting to see how this theory specializes in the case of commutative 
rings, and to consider relations between the artinian and noetherian conditions. Our first result 
in this direction is valid for rings that need not be commutative. 


THEOREM 7.13. Jf R is a ring that is left (right) artinian, then R is left (right) noetherian. 
Moreover, R has only a finite number of maximal ideals. 


Proof. LetJ= rad R, the Jacobson radical of R. By Theorem 4.3 (p. 202), J is nilpotent, so 
we have an integer n such that J” = 0. We have the sequence of ideals RO J> > DJ" = 
0. We regard R as left R-module. Then we have the sequence of R-modules M, = J‘/J'*! where 
J? = R and all of these are annihilated by J, so they may be regarded as modules for the semi- 
primitive artinian ring R = R/J. Since all modules of a semi-primitive artinian ring are 
completely reducible (Theorem 4.4, p. 208), this is the case for the modules M;. Moreover, any 
completely reducible artinian (noetherian) module is noetherian (artinian) and hence has a 
composition series. Accordingly, for each M, we have a sequence of submodules 


M; = Ma > Mi, > °° > Min, > Mim = 9 


l 


such that every M/M; + ı İs irreducible, and these can be regarded as R-modules. 


Corresponding to these we have a sequence of left ideals M} = J > Mp D` DM, p1 = J"! 


such that M;/M; ; +1 is an irreducible R-module. Putting together these sequences we obtain a 
composition series for R as left R-module. The existence of such a series implies that R is left 
noetherian as well as left artinian. The same argument applies to right artinian rings. This 
proves the first statement of the theorem. 

To prove the second, we note that any maximal ideal J of R contains J and T = //J is a 
maximal ideal of the semi-primitive artinian ring R. We have R = R:1®-"®R. where the R, are 
minimal ideals. It follows that the only maximal ideals of R are the ideals 
My = Ri ++ +R;-1+Rjs1 + +R, and that the only maximal ideals of R are the ideals Mahi 


+R, it Rj 4,47 +R, where R; is the ideal in R such that R; = R,/J.0 
For commutative rings we have the following partial converse to Theorem 7.13. 


THEOREM 7.14. IfR is a commutative noetherian ring that has only a finite number of 
prime ideals and all of these are maximal, then R is artinian. 


Proof. Since the Jacobson radical J is the intersection of the maximal ideals of R and the nil 
radical N is the intersection of the prime ideals of R, the hypothesis that the prime ideals are 
maximal implies that J = N. Hence J is a nil ideal and since R is noetherian, J is finitely 
generated. As is easily seen, this implies that J is nilpotent. Since R contains only a finite 
number of maximal ideals, R = R/J is a subdirect product of a finite number of fields. Then R is 
a direct sum of a finite number of fields (Lemma on p. 202) and hence R is artinian. As in the 


proof of Theorem 7.11, we have the chain of ideals RD JD °° > J” 7! J" = 0 and every JJ * 
l is a completely reducible noetherian module for the semi-primitive artinian ring R. Then J‘/.J! 
t1 has a composition series and R has a composition series as R-module. Then R is artinian. 0 


It is easy to determine the structure of commutative artinian rings. This is given in the 
following 


THEOREM 7.15. Let R be a commutative artinian ring. Then R can be written in one and 
only one way as a direct sum R = R; ® R, ® ... ® R, where the R, are artinian and 


noetherian local rings and hence the maximal ideal of R; is nilpotent. Conversely, if R = R} 
® ... ® R, where the R; are noetherian local rings with nilpotent maximal ideals, then R is 


artinian. 


Proof. We base the proof of the first part on the results on modules that we obtained in 
connection with the Krull-Schmidt theorem (pp. 110—115) and the fact that for a commutative 
ring R we have the isomorphism R a EndpR. We have seen that R has a composition series as 


R-module. Hence R = R; ® ... ® R, where the R; are indecomposable R-modules with 
composition series. Hence EndpX; is a local ring. Now R; a Endp,R; = EndpR;, so R; is a local 


ring. Now there is only one decomposition of a ring into indecomposable ideals (p. 204). 
Hence the R; are unique. The artinian property of R carries over to the R;. Hence, by Theorem 


7.13, every R; is also noetherian. It is clear also that the maximal ideal, rad R,, of R; is 
nilpotent. 


To prove the converse, suppose first that R is a commutative local ring whose maximal 
ideal J is nilpotent. Let P be a prime ideal in R. Then P c J and J” c P for some n. Since P is 
prime, this implies that J= P. Hence P is the only prime ideal of R. Since R is noetherian, R is 
artinian by Theorem 7.14. The general case in which R= R, ® ... ® R, follows immediately 


from this special case. O 


If R is an artinian ring and M is a finitely generated R-module, then M has a composition 
series since R is also noetherian and hence M is artinian and noetherian. We can therefore 
define the length (M) of M as the length of any composition series for M. The use of the length 
provides a tool that is often useful in proving results on modules for artinian rings. 


7.12 AFFINE ALGEBRAIC VARIETIES.THE HILBERT 
NULLSTELLENSATZ 


Let F be an algebraically closed field, F) the n-dimensional vector space of n-tuples (4,4, 
... ,), 4; E F, and F[x,,x>, ... ,x,] the ring of polynomials in n indeterminates x; over F. If S 


is a subset of F[x;, ... .x,] we let V(S) denote the set of points (a, ... ,a,) E F(n) such that 
fia), ..., a,) = 0 for every fe S and we call V(S) the (affine algebraic) variety defined by S. 


It is clear that (S) = V1) where J = (S), the ideal generated by S, and also VJ) = V(nilrad J). 
Hence if J; and J, are two ideals such that nilrad 7; = nilrad /,, then K) = VU). A 


fundamental result, due to Hilbert, is tha t conversely if VU,) = VQ) for two ideals in F[x,, 
..., X,], F algebraically closed, then nilrad 7, = nilrad J. There are a number of ways of 


proving this result. In this section we shall give a very natural proof, due to Seidenberg, which 
is based on Krull’s theorem that the nil radical of an ideal is the intersection of the prime 
ideals containing it and on a general theorem of elimination theory that we proved in a 
completely elementary fashion in BAI, pp. 322-325. For convenience we quote the theorem on 
elimination of unknowns: 

Let K =Z or 2/(p), p a prime, and let A = K[t), ..., t,], B = A[x), ..., x, ] where the 7’s and 
x’s are indeterminates. Suppose F, ..., Fn G € B. Then we can determine in a finite number of 


steps a finite collection (hl »--r} where fs = Wnv--imi9 © 4 such that for any extension field F 
of K and any (c, ..., ¢,), C; € F, the system of equations and inequations 


(27) Ellies Kinn Xe 22 FCs ung let Mayenne he) m 


a 
Olly N geasa VY 


is solvable for x’s in some extension field E of F if and only if the c; satisfy one of the systems 
Pe 
J 


(28) Salis 0r) = = fin (Cis) =O, Gliese) #0. 


Moreover, when the conditions are satisfied, then a solution of (27) exists in some algebraic 
extension field E of F. 

For our present purposes the important part of this result is the last statement. This implies 
the following theorem, which perhaps gives the real meaning of the algebraic closedness 
property of a field. 


THEOREM 7.16. Let F be an algebraically closed field and let fi, ..., fi g € FIX, -5 Xn] 
where the x; are indeterminates. Suppose that the system of equations and inequation 

(29) F(X yong X%q) =" fia =D. Gisk FO 

has a solution in some extension field E of F. Then (29) has a solution in F. 


Proof. The field F has one of the rings K = Z or 2/(p) as its prime ring. Now by choosing 
enough additional indeterminates 1, ..., t, we can define polynomials F}, ..., Fp» CG € B as in 


the elimination theorem such that Fc}, ..., Cs Xis «ney Xp) ESX] vie Xp) Gs ws Ch Xj ade 


X,) = g(x, --- Xn). Then it follows from the result quoted that, if (29) has a solution in some 


extension field E of F, then we have a j for which (28) holds, and this in turn implies that (29) 
has a solution in an algebraic extension field of F. Since F is algebraically closed, the only 
algebraic extension field of F is F itself (BAI, p. 216, or p. 460 below). Hence (29) is 
solvable in F. O 


Hilbert’s theorem, the so-called Nullstellensatz, is an immediate consequence of this result 
and the characterization of the nil radical as intersection of prime ideals. 
NULLSTELLENSATZ. Let I be an ideal in the polynomial ring Fx), ..., X,] in n 
indeterminates x; over an algebraically closed field F and let ge F[x, ..., x,,]. Suppose g(a, 
...5 Ay) = 0 for all (ay, ..., a„) on the variety V(I) defined by I. Then g € nilrad I. 


Proof. Suppose g € nilrad J. Then there exists a prime ideal P > J such that g É P. Consider 
the domain D = F[x,, ..., x, ]/P = F[X 1, .... X,] where X ;=x,+ P. Then for f= f(x), ..., Xp) 
e I, fe P and hence A X 4, ..., X ,) = 0. On the other hand, g É P, so (Xi ¥,) #0. Now 
F[x,, ..., X,] is noetherian, so Z has a finite set of generators fj, ..., f,,. Let E be the field of 
fractions of D. Then £ is an extension field of F containing the elements X}, ..., X „ such that 
AXi. X,)=0, 1 <ism,g(X,..., X,) #0. Hence by Theorem 7.14, there exist a, ..., 
a, E€ F such that f(a), ..., a,) = 0, g(a), ..., a,) #0. Since the f; generate Z, we have f(a, ..., 
a,) = 9 for all fe J. This contradicts the hypothesis and proves that g € nilrad 7. D 


Evidently the Nullstellensatz implies that if /; and J, are ideals in F[x), ..., x,,] such that 
VWL) = VU), then J, c nilrad J, so nilrad /,c nilrad (nilrad /,) = nilrad J. By symmetry, 
nilrad J} c milrad J, and so nilrad J, = nilrad /,. An important special case of the 
Nullstellensatz is 


THEOREM 7.17. Jf 1 is a proper ideal in F[x, ..., x,], F algebraically closed, then V(I) + 
Ø. 


Proof. If V(I) = Ø, then g = 1 satisfies the condition of the Nullstellensatz so 1 € nilrad Z and 
hence 1 €e Z, contrary to the hypothesis that Z is proper. 0 


The Nullstellensatz permits us also to determine the maximal ideals in F[x), ..., x,]. If (a), 
vey dy) E€ FO, the ideal My a, = (01 — Gy, «++ Xp — Gq) is maximal in Flxy, ..., x,] since 
FIX, «+5 Xa ]/Ma,, ... a, = F. Evidently VM, a) consists of the single point (a), ..., a,). IFZ 
is any proper ideal, the foregoing result and the Nullstellensatz imply that 7 c nilrad M,, 


an 


for some point (a, ..., a,). Since M4, is maximal, it coincides with its nilradical, so Z c 
ingle 


5a 


Ma ,... a, Hence we see that the only maximal ideals of F[x;, ..., x,,] are those of the form M,, 


„a, Moreover, since VM, a) = (4), -++5 Gn) }, We have the following result. 


COROLLARY. /f F is an algebraically closed field, then the map (a, ..., d,) ~= M, 


alo ++ 


Ay, = (X1 — Ay, <- Xy —,y) is a bijection of F® onto the set of maximal ideals of F[x), ...,X,]- 


We now suppose that F is any infinite field and we consider again F™) and FTx,, ..., Xp], 
and define the variety V(S) for a subset S of F[x,, ..., x,] as in the algebraically closed case. 
Also if V is a variety, we let i(V,) denote the ideal of polynomials fe F[x,, ..., x,], which 
vanish for every (aj, ..., a,) € V}. The following properties follow directly from the 
definition: 

(1) VF Ixy, ..., X,)) = Ø, 
(2) VO) = F”, 
(3) VUS,) = WS,); 
(4) WS) = VV) if = (S), the ideal generated by S, 
(5) WLD) = VU) o VU) for any ideals J, and J, 
(6) WSD D WS) if S1 c S, 
(7) i(Q) = FLX, ..., Xy] 
(8) i(V,) D (V5) 1f V, CVs, 
(9) i(VU,)) > J, for any ideal 4, 
(10) Vi(V,)) > V; for any variety V}. 


Relations (6), (8), (9), and (10) have the immediate consequences that 

(11) EVD) = VW), 

(12) iVE) = iV). 
We now recall an important theorem, proved in BAI on p. 136, that if F is an infinite field and 
f(x}, ..., X,) is a non-zero polynomial with coefficients in F, then there exist a; € F such that 
fia), ..., a,) #9. Evidently this implies 

(13) i(F) =0. 

Relations (1)—(5) show that the set of varieties satisfy the axioms for closed subsets of a 

topological space. The resulting topology is called the Zariski topology of the space F™). The 
open subsets in this topology are the sets F”) — V, V a variety. If V = V(S), then F™® — V = Ore 


S (F™ — V({ft)). Thus the open sets 0; = (F™ — V({ft)) form a base for the open subsets of 
F”), Evidently Oris the set of points (a), ..., a„) such that f(a), ..., dn) # 0. 
We now observe that if F is algebraically closed (hence infinite), then F) with its Zariski 


topology is the same thing as the maximum spectrum of the ring F[x), ..., x,,]. More precisely, 
we have a canonical homeomorphism between these two spaces. This is the map (a), ..., d,) ~ 
M,,, ..., a, given in the corollary above. Since the condition that f(a, ..., a,) = 0 for a 


polynomial f(x, ..., x,) is equivalent to fe Ma, ..., a, the map we have defined induces a 


bijective map of the set of closed sets in F) with the set of closed sets in Maxspec (see p. 
403). Hence we have a homeomorphism. 

When F = R or ©, the Zariski topology on F” has strikingly different properties from the 
usual Euclidean topology. Let us now consider some of these properties for F”), F any infinite 
field. Since any point (a), ..., a„) is the variety defined by the ideal M3 |... a, = (1 — Gy, <--> Xp 
—a,), it is clear that F” is a T,-space. However, it is not a Hausdorff space. On the contrary, 
F”) is irreducible in the sense that any two non-vacuous open subsets of F” meet. In other 
words, any non-vacuous open subset is dense in F”. Since the sets Oy constitute a base, it 
suffices to see that if Or + Ø and Q; + Ø, then Or N Q; + Ø. The theorem on non-vanishing of 
polynomials implies that O; # Ø if and only if f# O. It is clear also that O; N O, = Of. Since 
F(x, ...,X,] is a domain, it follows that O; + Ø, O; + Ø imply Op N O, + Ø. 

We note next that the space F®™” with the Zariski topology is noetherian in the sense that the 
ascending chain condition holds for open subsets of F). Equivalently the descending chain 
condition holds for varieties. Suppose that V} > V) D ... is a descending chain of varieties. 


Then we have the ascending chain of ideals i(V,) c i(V) C ..., so there exists an m such that 
i(V;) = i(V; +1) for all j > m. Since V; = V(i(V;)), we have V; = V; , | for j >m. 
Let us now look at the simplest case: n = 1. Here the variety V({f}) defined by a single 


polynomial f is a finite set f + 0 and is the whole space F if f= 0. Moreover, given any finite 
set {a;| 1 <i<r} we have V({f}) = {a;} for f= [|(x — a;). It follows that the closed sets in the 


Zariski topology are the finite subsets (including the vacuous set) and the whole space. The 
open subsets of F are the vacuous set and the complements of finite sets. A subbase for the 
open sets is provided by the complements of single points, since any non-vacuous open set is a 
finite intersection of these sets. 

It should be observed that the Zariski topology provides more open sets than one gets from 
the product topology obtained by regarding F” as product of n copies of F. For example, the 
open subset of F™” defined by x, + ... + x, # 0 cannot be obtained as a union of open subsets 
of the form O; x 0, x ... x O,, O; open in F. 

Any polynomial f(x), ..., x,,) defines a polynomial function 


(30) (Bi cng leg) ~> f (dy, ..+5Ay) 


of F™ into F. We have considered such functions in BAI, pp. 134-138. The theorem we 
quoted on non-vanishing of polynomials shows that the map sending the polynomial f(x), ..., 


x,) into the function defined by (30) is an isomorphism of F[x,, ..., x,] onto the ring of 
polynomial functions. More generally, consider a second space F”), Then a sequence (fj, ..., 
hin)» Fi =S +++» Xp), defines a polynomial map 


(31) (p++ yn) > (fu (O15 +++5Qy)y -fnll ++ +5 4q)) 


of F”) into F™ Such maps are continuous of F into KF”, both endowed with the Zariski 
topology. To see this we take any neighborhood O, of (f{(a), ..., an), ---» Sin(@1> +++» Gn). Then 
we have a polynomial g(x), ..., Xm) such that g(f,(@), ..., an), --->Si(Q, «+> 4,)) #0. Then A(x}, 
oo Xn) = BAX e Mads ofp +--+» Xp) E O in F[x, ..., Xy] and O, is mapped into O, by 
(31). Thus (31) is continuous. We remark that since non-vacuous open subsets are dense, to 
prove that a polynomial map is 0 (that is, sends every element into 0), it suffices to show this 
for a non-vacuous open subset. 

These rudimentary algebraic geometric ideas are often useful in “purely algebraic” 
situations. Some illustrations of the use of the Zariski topology will be given in the following 
exercises. 


EXERCISES 


In all of these we assume F algebraically closed (hence infinite). The topologies are Zariski. 
1. Leta e M,(F) and let f,(A) = 4” — tr(a) 4” 7 |! + ... +- 1)” det a be the characteristic 


polynomial of a. Show that the maps a ~ tra, ..., a ~ det a are polynomial functions on 
the n?-dimensional vector space. Show that the set of invertible matrices is an open 
subset of M,(F). 


2. Let {p}, ...,, } be the characteristic roots of a (in some order) and let g(x), ...,x,) be a 
symmetric polynomial in the x; with coefficients in F. Show that the mapa ~ g(p, ..., 
Pn) is a polynomial function on M,(F). Show that the set of matrices, which are similar 
to diagonal matrices with distinct diagonal entries, is an open subset of M,(F). 


3. Let f, g e F[x), ..., x,] and suppose that g(a), ..., a„) = O for every (a, ..., a,,) such that 
Jia, ..., ap) = 0. Show that every prime factor of fis a factor of g. 


4. Ifa e M,(F), let U, denote the linear transformation y ~ aya of M,(F) into itself. Let 
d(a) = det U,. Use the Hilbert Nullstellensatz and Theorem 7.2 of BAI, p. 418, to prove 
that d(a) = (det a)”. 

5. Give an alternative proof of the result in exercise 4 by noting that it suffices to prove the 
relation for a in the open subset of matrices that are similar to diagonal matrices with 
non-zero diagonal entries. Calculate d(a) and de a for a diagonal matrix. 

6. Show that the nil radical of any ideal in F[x), ..., x„] is the intersection of the maximal 
ideals containing it. 


7. Let V be a variety in F), A polynomial function on V is defined to be a map p|V where p 
is a polynomial function on F), These form an algebra A(V) over F under the usual 
compositions of functions. Show that A(V) a F[x,, ...,x,|/i(V) = FIX i... Xp, X¥; =x; 


+ i(V). The latter algebra over F is called the coordinate algebra of the variety V. V is 
called irreducible if i(V) is prime. In this case the elements of the field of fractions of 
the coordinate algebra are called rational functions on V. If V' is a variety in F”, a 
map of V into Vis called regular if it has the form g|V where g is a polynomial map of 
F into F™. Show that if p' is a polynomial function on V’, then p'g is a polynomial 
function on V and the map 7(g) : p’ + p is an algebra homomorphism of A(V’) into A(V). 
Show that g ~ n(g) is a bijection of the set of regular maps of V into V’ and the set of 
algebra homomorphisms of A(V’) into A(V). 


7.13 PRIMARY DECOMPOSITIONS 


In this section we consider the classical Lasker-Noether decomposition theorems for ideals in 
noetherian rings as finite intersections of primary ideals. It is easy to see by using the 
ascending chain condition that any proper ideal J in a noetherian ring R can be written as an 
intersection of ideals that are indecomposable in the sense that /# 7; J, for any two ideals J; 


+ I. Moreover, in any noetherian ring, indecomposable ideals are primary in a sense that we 
shall define in a moment. This type of decomposition is a weak analogue of the decomposition 
of an element as a product of prime powers. Associated with every primary ideal is a uniquely 
determined prime ideal. However, primary ideals need not be prime powers and not every 
prime power is primary. Although the decomposition into primary ideals is not in general 
unique, it does have some important uniqueness properties. The establishment of these as well 
as the existence of the primary decomposition are the main results of the Lasker-Noether 
theory. We shall begin with the uniqueness questions, since these do not involve the noetherian 
property. The classical results can be generalized to modules and the passage to modules 
makes the arguments somewhat more transparent. In our discussion we shall consider the 
general case of modules first and then specialize these to obtain the results on ideals. 

IfM is a module for R and a € R, then as in section 3.1, we denote the map x ~ ax, x € M, 
by ay. Since R is commutative, this is an endomorphism of M as R-module. We have the 


homomorphism py : a ~ ay of R into the ring of endomorphisms of M and ker pj, is the set of 
a € R such that ax = 0 for all x. As usual we write annpx for the ideal of elements b € R such 
that bx = 0. Evidently ker py =O, e yy annex = annpM. If ax = 0 for some x +£ 0, then we shall 


call this element a of the ring R a zero divisor of the module M. The elements a that are not 
zero divisors are those for which aj, is injective. We now look at the nil radical of the ideals 


ker pj and annpx for a particular x. By definition, the first of these is the set of elements a € R 
for which there exists an integer m such that a” € ker py, that is, ax = 0 for every x or, 
equivalently, ay” = (a) = 0. Likewise a € nilrad (annpx) if and only if there exists an m 
such that ax = 0 or, equivalently, the restriction of aj, to the submodule Rx is nilpotent. 


The case of primary interest is that in which M = R/I where 7 is an ideal in R. Since we 
shall need to consider simultaneously such a module and the modules R//’ for /' an ideal of R 


containing J and since R/! = (R/V), we need to formulate our definitions and results in terms 
of a module and a submodule. The style is set in the following 


DEFINITION 7.4. A submodule O of an R-module M is called primary if OQ + M and for 
everya € Reithera m/o Ì5 injective or it is nilpotent. 


Evidently this means that if a is a zero divisor of M/Ọ, then there exists an integer m such 
that ax € QO for every x e M. The set P of elements satisfying this condition is the ideal nilrad 
(annp M/Q). This ideal is prime since if a É P and b € P, then amọ and bmo are injective. 
Then (ab) vig = amo Sug 1S injective and hence ab É P. We shall call P the prime ideal 
associated with the primary submodule Q. If M = R, the submodules are the ideals. Then the 
condition ax € Q for every x € R is equivalent to a” €e Q. Thus in this case an ideal Q is 


primary if and only if ab € Q, and b É QO implies that there exists an m such that a” € Q. The 
associated prime of Q is the nilradical of this ideal. 


EXAMPLES 


1. Let R be a p.i.d. and Q = (p°) = (p°), p a prime. It is readily seen that ifa € R and a É P 
= (p), then agg is invertible, hence injective. On the other hand, if a € P, then a°pig = 0. 
Hence Q is primary and P is the associated prime ideal. It is easy to see also that 0 and the 
ideals (p°) are the only primary ideals in R. 


2. Let R = Z and let M be a finite abelian group written additively and regarded as a Z- 
module in the usual way. Suppose every element of M has order a power of a prime p. Then 0 
is a p-primary submodule of M since if a € Z and (a, p) = 1, then a, is injective and if a is 
divisible by p, then ay is nilpotent. The associated prime ideal of 0 is (P). 


3. Let R = F[x, y, z]/(xy — z2) where F is a field and x, y , z are indeterminates. If a € F[x, 
y, Z], let a be its image in R under the canonical homomorphism. Let P = ( X , Z), the ideal in R 
generated by X , Z. Then the corresponding ideal in FT x, y, z] is (x, z), since this contains xy — 
z’. Since FTx, v, z|/(x, z) a F [y], it follows that P is prime in R. Since it is clear from Krull’s 
theorem that the nilradical of any power of a prime ideal is this prime, the ideal P? of R has P 
as its nilradical. However, this is not primary since X É P?, Y É P but w ==eP", 


We can obtain many examples of primary ideals by using the following 


PROPOSITION 7.24. Jf P is a maximal ideal in R, any ideal QO between P and P®, e = 1, is 
primary with P as associated prime ideal. 


Proof. Ifae P, thena® € P? CO. Ifa € pP, then we claim that a + Q is a unit in R/O. For, 
since P is maximal, R/P is a field. Hence we have an element a'€ R and a z e P such that aa’ 


= ] - z. Since z + Q is nilpotent in R/Q, (1 — z) + Q is a unit in R/O and hence a + Q is a unit 
in R/Q. It is now clear that if ab € Q, then b € Q. Thus we have shown that if a € P, then agg 
is nilpotent and if a É P, then apo İS injective. Then it is clear that Q is primary and P is the 
associated prime. O 

This result can be used to construct examples of primary ideals that are not prime powers. 
For instance, let P = (x, y) in Fx, y]. This is maximal, since F[x, y|/P = F. We have 
(xy) 2 O*,y) 2 ayy?) =P? Hence (x7, y) is a primary ideal that is not a prime power. 

It is clear that if O is a submodule of M and P is a subset of R such that 1) if a € P, then 
there exists an integer m such that ax € Q for all x e M and 2) ifa € R- P, then amọ is 
injective, then Q is primary with P as associated ideal. We use this to prove 


PROPOSITION 7.25. The intersection of a finite number of primary submodules of M 
having the same associated prime ideal P is primary with P as associated prime ideal. 


Proof. It suffices to prove this for two submodules Q; and Q,. Then Q} N Q, is a proper 
submodule since the Q; are proper. Let a e P. Then we have an integer m; such that a™"xeQ, for 
all xe M. If we take m = max(m, mz), then a”x € Q = Q; N Q. Next leta € R-P and let x € 
M — Q. We may assume that x É QO,. Then ax ¢ Q; so ax É Q. Thus amọ İS injective. Hence Q 
is primary with P as associated prime ideal. O 

We have noted that for any x the set ofa € R such that there exists an integer m such that a”x 
= 0 is the ideal nilrad(ann x). Hence if Q is a submodule and X =x + Qin M/Ọ, then the set of 
a for which there exists an m such that ax € Q is the ideal nilrad(annp X ). If Q is primary, we 
have the following important result. 


PROPOSITION 7.26. Let Q be a primary submodule of M, P the associated prime, and let 
x € M. Let I, = {alax € QO for some m}. Then I, = P ifx ¢ Oandl.=Rifxe Q. 


Proof. Letx É Q. Evidently I, D P, since ayjg is nilpotent for every a € P. On the other hand, 
if a € P, then amọ İS injective. This precludes ax € Q for any m. Hence 7, = P. The other 
assertion that ifx € Q, then /, = R is clear. DO 


Now let N be a finite intersection of primary submodules Q; of M: N=!) Q,. It may happen 
that some of these are redundant, that is, that N is an intersection of a proper subset of the Q;. In 
this case we can drop some of these until we achieve an irredundant decomposition N =| 10, 
which means that for every j, N #( ), +; Qi This is equivalent to assuming that for every j, 
Q; > (inj 


We shall prove two uniqueness theorems for decompositions of a submodule as intersection 
of submodules of M. Before taking this up in the general case, we illustrate the results in the 
special case of ideals of a ring in the following 


EXAMPLE 


Consider the ideal 7 = (x?, xy) in F[x, y]. Let Q = (x), Q, = (v — ax, x”) for a e F. Q is prime, 
hence primary with associated ideal P = Q. Since P) = (x, y) is maximal and 
P22 0.>Ps'= boo OQ is primary with associated prime P}. Evidently Z c Q N Q,. Now let 
h(x, y) E Q,, sO 


hix, y) = f(x. y)x? +g(x, y) (y= ax) 


for f(x, y), g(x, yje Fix, y]. If h(x, y) € (x), we have g(x, y) = xk(x, y) and then h(x, y) € T. 
Hence ON Q, cI, so I= Q N Q,. Since 2+2. and %+2, the decomposition is irredundant. 


Since Q, # Q, if a + b, we have distinct decompositions of / as irredundant intersections of 
primary ideals. It should be remarked that the associated primes P and P, are common to all of 


the decompositions and so is the primary ideal Q. This illustrates the two general uniqueness 
theorems, which we shall now prove. 


The first of these is 


THEOREM 7.18. Let N be a submodule of M and let N=( ı Q; where the QO, are primary 
submodules and the set is irredundant. Let {P,} be the set of prime ideals associated with 
the Q; Then a prime ideal P € {P;} if and only if there exists an x e M such that the set 
IN) of elements a € R for which there exists an m such that ax € N is the ideal P. Hence 
the set of prime ideals {P;‘ is independent of the particular irredundant primary 
decomposition of N. 


Proof. Since N=(¥,O, and s is finite, it is clear that for any x € M, L(N) = (V40). By 
Proposition 7.26, [,(Q,) = P; if x € Q; and (Q) = R if x € O,. Hence I,(N) = Pi, where the 
intersection is taken over those 7; such that x ¢ Q;;. Now suppose that /,(V) = P a prime ideal. 
Then P= NP, > TIP; so P> P for some i;. Since P, >P, we have P = P;, Next consider any 
one of the associated primes P;. By the irredundancy of the set of Q’s we can choose an 
x€Q,€(\j+i2; . Then the formula for J,(N) gives I,(N) = P;. This completes the proof. D 


We shall call the set of prime ideals {P;} the set of associated prime ideals of N =| FO; 


l 
These give important information on the submodule N. For example, we have 


THEOREM 7.19. Let N be a submodule of M that is an intersection of a finite number of 


primary submodules and let {P;} be the set of associated prime ideals of N. Then UP, is the 


set of zero divisors of M/N and | }P, is the nilradical of ker p yyy. 


Proof. We may assume that N =(\Q, as in the preceding theorem and P; is associated with 
Q;. (We may have P; = P, for i #7.) The foregoing proof shows that ifa € P;. we have an x € 
M — N such that ax € N for some positive integer m. If m is taken minimal, then a”~ lx É N 
and a(a”~ 'x) € N and so a is a zero divisor of M/N. Thus every element of UP, is a zero 
divisor of M/N. Conversely, let a be a zero divisor of M/N. Then we have an x € M, É N such 
that ax € N. Since x € N, x € Q; for some i and ax € Q,. Hence amo, is Not injective, so a € 
P.. Thus UP, is the set of zero divisors of M/N. This is the first statement of the theorem. To 
prove the second we recall that mlrad (ker pyy) is the set of a e R for which there exists an m 
such that ax € N for every x e M. This is the intersection of the set of elements a; € R for 
which there exists an m; such that 4f" X €Q; for every x € M. The latter set is P,. Hence nilrad 
(ker Puy) =(P; O 


It is clear also from the uniqueness theorem and Proposition 7.25 that a submodule that is an 
intersection of primary submodules of M is primary if and only if it has only one associated 
prime ideal. 

We now suppose that the irredundant decomposition N =! ',Q, into primary submodules is 
normal in the sense that distinct Q; have distinct associated prime ideals. This can be achieved 
by replacing any set of the Q; associated with the same prime by their intersection, which is 
primary, by Proposition 7.25. In the case of a normal decomposition we have a 1-1 
correspondence between the primary submodules of the decomposition and their associated 
prime ideals. We shall now call a primary submodule Q; in the normal decomposition N = | 
s Q; isolated if the associated prime ideal P, is minimal in the set of associated prime ideals of 


N. The second uniqueness theorem states that the isolated components persist in every normal 
decomposition of N as intersection of primary ideals: 


THEOREM 7.20. Let N=! 1Q; be a normal primary decomposition of N, P, an associated 
prime that is minimal in the set of associated primes. Then we can choose ana e | \ gl ¢ 
P, and if a is any such element, then the isolated component Q; corresponding to P; can be 


characterized as the set of x € M such that ax € N for some integer m. Hence the isolated 
component Q; is independent of the particular normal primary decomposition of N. 


Proof. If no a of the sort indicated exists, then P; 5! b4 iP; and hence P; > P; for some j # i, 


contrary to the minimality of P;. Now let °‘ Went Then a MIO; is nilpotent for j # i and awọ, 


is injective. Hence if x € M, there exists an integer m such that ax € Q; for j # i and hence 


axe \)+@), On the other hand, @#« is injective for every m, so ax € Q; holds if and only if x € 


Q,. Since N=(}°,Q,, we see that the stated characterization of Q; holds and its consequence is 
clear. O 

In the example given on p. 433 the associated primes are (x) and (x, y), so (x) is minimal. 
The corresponding isolated component is (x) and this persists in all of the decompositions of / 
= (x*, xy). We remark also that in a ring R in which all non-zero primes are maximal (e.g., a 
p.i.d.), the primary decomposition is unique if none of the associated primes is 0. 

We now assume that M is noetherian and we shall show that every submodule N of M can 
be expressed as a finite intersection of primary submodules. We shall call a submodule N of M 
(intersection) indecomposable if we cannot write N as N, N N, where N; # N for i = 1, 2. We 


first prove 


LEMMA 1. If M is noetherian, any submodule N # M can be written as a finite 
intersection of indecomposable ones. 


Proof. Ifthe result is false, the collection of submodules for which it is false has a maximal 
element N. Then N is not indecomposable, so we have submodules N, and N, such that N; # N 


and N; N M, = N. Then ^: FN and so, by the maximality of N, N; is a finite intersection of 
indecomposable submodules. Then so is N= N, N N}, contrary to assumption. O 


The result we wish to prove on primary decompositions will be an immediate consequence 
of this lemma and 


LEMMA 2. IfM is noetherian, then any indecomposable submodule N of M is primary. 
Proof. The proof is due to Noether and is similar to the proof of Fitting’s lemma (p. 113), 


which it antedated. If N is not primary, we have ana € R such that ayy is neither injective nor 
nilpotent. Consider the sequence of submodules of M/N 


0 c keray E kerdh €. 


-z mti |... . 1 
n = ker aMn = Since ayy 18 not 


Since M/N is noetherian, we have an m such that ker ‘dMi 
nilpotent, ker TMn # M/N Hence ker avin = N\/N where N, is a proper submodule of M 
containing N. Also since ayy is not injective, ker amin # 0, so N, # N. Now consider the 
submodule N, = aM + N and let x e N, N N>. Then x = ay + z where y e M, z € N, and ax 


ee 2m 
€ N. Hence a?”y € Nand the condition mn = Ket aMn gives ay e N. Thenx=a"y+ze N. 
Thus N; N NM, = N and N,#N, since N, # M. Hence N= N, N N, is decomposable, contrary to 


the hypothesis. O 


Evidently the two lemmas have the following consequence. 


THEOREM 7.21. Any submodule of a noetherian module M has a decomposition as finite 
intersection of primary submodules of M. 


Everything we have done applies to ideals in noetherian rings. In this case we obtain that if 
I is a proper ideal in a noetherian ring, then J =! \O, where the Q; are primary ideals. We may 
assume also that this representation of / is irredundant and the associated prime ideals P; of the 
Q; are distinct. The set {P;} is uniquely determined and these P, are called the associated 
primes of I. The Q; whose associated prime ideals P; are minimal in the set {P;} are uniquely 
determined, that is, they persist in every normal representation of J as intersection of primary 
ideals. 

Theorem 7.19 specializes to the following: If {P;} is the set of associated primes of Z, then 
UP, is the set of zero divisors modulo J, that is, the set of elements z € R for which there exists 
ay € R—Isuch that yz € J. Moreover, | \P; is the nil radical of J. Evidently! \P, = ( IP, where 
{P+ is the set of associated primes that are minimal in {P,}. 

If / is a proper ideal in a ring R, a prime ideal P is said to be a minimal prime over I if P > 
I and there exists no prime ideal P’ such that P = P’ > J. If R is noetherian, there are only a 
finite number of minimal primes over a given J: the minimal primes among the associated 
prime ideals of J. For, if P is any prime ideal containing Z, then P > nilrad J = || P;, Pi a 
minimal associated prime ideal of 7. Then P > IIP, and P > P, for some j. Hence P = P, if and 


only if P is a minimal prime over /. Taking J = 0 (and R # 0) we see that a noetherian ring R 
contains only a finite number of prime ideals P that are minimal primes in the sense that there 
exists no prime ideal P’ such that P = P'. These prime ideals are called the minimal prime 
ideals of R. Evidently R is a domain if and only if 0 is its only minimal prime ideal. 

We also have the following result on nil radicals in the noetherian case. 


THEOREM 7.22. Jf 1 is an ideal in a noetherian ring R and N is its nil radical, then there 
exists an integer m such that N” c I. In particular, if Ọ is a primary ideal in R and P is the 
associated prime, then P” c Q for some integer m. 


Proof. N has a finite set of generators z}, ..., z,, which means that N consists of the elements 
4,2; a; € R. We have an integer m; > 1 such that =" € I. Put m = >" m; — (r — 1). Then any 
product of m of the z; contains a factor of the form =" and hence is contained in /. It follows that 


any product of m elements of N is contained in /. Thus N” c IJ. If Q is primary, the associated 
prime P is nilrad Q. Hence the second statement follows from the first. O 


EXERCISES 


1. Let S be a submonoid of the multiplicative monoid of R, M an R-module, N a submodule 
that is an intersection of a finite number of primary submodules. Show that M, = N, if 


and only if PINS # Ø where {P;} is the set of associated prime ideals of N. Show that if 
Q is a primary submodule and Q, # M,, then Q, is a primary R,-submodule of M,. Show 
that if N, # M, (N as before), then N, has a decomposition as finite intersection of 
primary R,-submodules. 


2. Let N be a submodule of M and suppose that N = ‘@ is a normal primary decomposition 
of N. Let {P;} be the set of associated primes. Call a subset T of {P;} isolated if for any 
P, € T, every P; c P; is contained in T. Show that if T is isolated, then \® taken over the 
j such that P, € T is independent of the given normal primary decomposition of N. This 
submodule is called the isolated component of N corresponding to 7. 


3. If J, and /, are ideals in a ring R, define the quotient (J; : L) = {a € Rial, CI, }. Note 
that (Z; : L) > 4. Show that if R is noetherian, then (J, : L) = /, if and only if J, is not 
contained in any of the associated primes of 4. 


7.14 ARTIN-REES LEMMA. KRULL INTERSECTION THEOREM 


In the remainder of this chapter we shall derive some of the main theorems of the theory of 
commutative noetherian rings. Some of these results were originally proved by using the 
primary decomposition theorems of the previous sections. Subsequently simpler proofs were 
devised based on a lemma that was discovered independently and about the same time by E. 
Artin and D. Rees. We shall begin with this result. For the proof we shall make use of a certain 
subring of a polynomial ring that is defined by an ideal in the coefficient ring. 

Let R be a ring, R[x] the polynomial ring over R in an indeterminate x, / an ideal in R. We 
introduce the Rees ring T defined by / to be the R-subalgebra of R[x] generated by /x. Thus 


(32) T = R+Ixt x + e 1"x" $00, 
We now assume that R is commutative noetherian. Then / has a finite set of generators as R- 
module. If these are b,,b>, ..., b,, then Jx is generated as R-module by b,x,box, ..., b,x. It 


follows from (32) that T is generated as R-algebra by b,x, ..., b,x. Hence, by the Corollary to 
Hilbert’s basis theorem (p. 418), T is noetherian. 

Next let M be an R-module. Then we obtain the R[x]-module R[x] ® p M whose elements 
have the form =*®™, m; E€ M. It is convenient to regard this R[x]-module as a set of 
polynomials in x with coefficients in the module M. For this purpose we introduce the set M[x] 
of formal polynomials in x with coefficients in the module M. This set can be regarded as an 
R[x]-module in the obvious way. Using the universal property of RLX] @pM, we obtain a group 
homomorphism of R[x] ®@pM onto M[x] sending £x'@m ~ X™x", and using the fact that mx! = 0 
implies every m; = 0, we see that our homomorphism is an isomorphism. Moreover, it is clear 


that it is an R[x]-isomorphism. Hence we can identify R[x] ®pM with M[x], and we prefer to 


work with the latter model. It is clear that any set of generators of M as R-module is a set of 
generators for M[x] as R[x]-module. 

We now consider the subset TM of M[x]. This is the 7-submodule of [x] generated by M 
and it has the form 


(33) TM = M+(IM)x+(?7M)x? + + (IM) + + 


It is clear that any set of generators for M as R-module is a set of generators for TM as T- 
module. Hence if M is a finitely generated R-module, then TM is a finitely generated 7-module. 
Moreover, if R is noetherian, then T is noetherian and then TM is a noetherian 7-module. We 
shall use this fact in the proof of the 


ARTIN-REES LEMMA. Let R be a commutative noetherian ring, I an ideal in R, M a 
finitely generated R-module, M, and M, submodules of M. Then there exists a positive 


integer k such that for everyn > k 
(34) I"M, ^ M, = I""*(KM, ^ M3). 


Proof. We have !"~*U'M, ^ Mı) = I"M, ^ M3 for any k and n > k, so we have to show that there is 
a k such that !"M, ^ M, = I'*U*M, ^ M3) for every n > k. For this purpose we define the ring T as 
before and the 7-submodule TM of M[x]. We now define the subgroup 


N = (M, M,)+(IM, 9 Ma)x + (P? M, o Ma)? t 


of TM. Since J/M, N M; is an R-submodule of /M and since (1x) (2M, 9 M3)x! = (P*'M, A Ma)" N 
1 2 ; 


is a T-submodule of TM. Since TM is a noetherian T-module, N has a finite set of generators, 
Say, Up ..., U„ Where we can write “= 2-0" , where "u€!/M: M2, Now let n > k and let u € 


I"M, N M,, so ux” e N. Then we have f; € T so that ux” = X fiu; If we write f; = MX Kel’ we 


obtain ux” = Hane” Since j < k, comparing coefficients of x” gives an expression for u as a 
sum of terms of the form fv where v e I/M, N M, with] < k and fe I” ~J. Now 


I"-(EM, n M3) = I'M n M,) c I"-(EM, 0M). 
Hence u € J”~*(I* M, O M3). Thus MM: ^M, c MUM, ^ Ma), O 
Our first application of the Artin-Rees lemma is 


KRULL’S INTERSECTION THEOREM. Let R be a commutative noetherian ring, I an 
ideal in R, M a finitely generated R-module. Put !°M = (\e:!"M, Then IUM) = I°M, 


Proof. Put M, =M, M2=!"M, Then we have a k such that "M ^ I°M = PUM ^ I"M) for all n > k. 
Taking n = k +1 we obtain !*'Mol'M = I(I*M ^ I"M) and since MM ^1"M = 1°M and Ä*'M n1I"M =1°M, 


we have the required relation !(/"M) = "M, O 
In section 3.14 (p. 174) we derived a criterion for the equality JM = M for J an ideal in a 
commutative ring R and M a finitely generated R-module: IM = M if and only if / + annpM = R. 


In element form the latter condition is equivalent to: there exists b e J such that (1 + b)M = 0. 
For, if b is such an element, then 1 =— b + (1+ b) and- be J/,1+b6€ annpM. Hence 1 e J+ 


annpM and since J + annpM is an ideal, Z + annpM = R. The converse is clear. 


If we combine this result with the Krull intersection theorem, we see that if Jis an ideal ina 
noetherian ring and M is a finitely generated module, then there exists a b € J such that (1 + 


b)I?M = 0. If I = rad R, the Jacobson radical of R, then every 1 + b, b e J, is invertible so we 
can conclude that (rad R)°M = 0. This follows also from Nakayama’s lemma. At any rate we 
have 


THEOREM 7.23. Let M be a finitely generated module for a noetherian ring R and let J = 
rad R. Then (\m=1J"M = 0, 


We obtain an interesting specialization of the foregoing results by taking M = R (which is 
generated by 1). Submodules of R are ideals. Then the ArtinRees lemma states that if Z, /;, and 


I, are ideals in R, then there exists a k such that for all n > k 
(35) PL 01, = P(A, oL). 


If we take /, = R and change the notation slightly, we obtain the result that if 7, and /, are ideals 
in a noetherian ring R, then there exists a k such that for n > k 


(36) L'al, = PLEA). 


This was the original form of the lemma given by Rees. If we take M = R in Theorem 7.21, we 
obtain 


(37) A= 0 


for J, the Jacobson radical of a commutative noetherian ring. 
As we shall see in section 7.18, the results of this section play an important role in the study 
of completions of rings. 


7.15 HILBERT’ S POLYNOMIAL FOR A GRADED MODULE 


In the same paper in which he proved the basis theorem and the theorem on syzygies, Hilbert 
proved another remarkable result, namely, a theorem on the dimensionality of the homogeneous 


components of a graded module for the polynomial ring R = F[x), ..., X„] with coefficients in a 
field F (see the reference on p. 387). He showed that if M = ®M,, is graded where M, is the 
homogeneous component of degree n, then there exists a polynomial f(t) of degree < m — 1 with 
rational coefficients such that for all sufficiently large n, An) = [M,, : F]. We shall prove this 
theorem in a generalized form in this section and apply it in succeeding sections to derive the 
main results on dimensionality in noetherian rings. 

We recall that a ring R is graded by an indexed set of subgroups R; i = 0, 1,2, ..., of its 
additive group if R = ®R, and RjR; C Rj; 
€ Ro. Hence Rọ is a subring of R. It is clear that R * = 2; >o 8; is an ideal in R. If R is 
commutative, then R can be regarded as an algebra over Rọ. We recall also the definition of a 
graded R-module M for the graded ring R (p. 384). Here we have M = ®; > ọM; where the M; 
are subgroups and R; M, C M; +; for all i, j. The elements of M, are called homogeneous of 
degree i. We have the following 


for all i, j. Then Rọ? c Ry and it is easily seen that 1 


PROPOSITION 7.27. Jf R is a graded commutative ring, then R is noetherian if and only 
if Ro is noetherian and R is finitely generated as Ro-algebra. If the conditions hold and M is 


a finitely generated graded R-module, then every M, is a finitely generated Ro-module. 


Proof. If Ro is noetherian and R is a finitely generated Rp-algebra, then R is noetherian by the 
Corollary to Hilbert’s basis theorem (p. 418). Now assume that R is noetherian. Then Rọ = R/ 
R* is a homomorphic image of the noetherian ring R, hence is noetherian. The ideal R” of R is 
finitely generated as R-module by, say, x), ..., Xm. By replacing each x; by its homogeneous 
parts, we may assume that the x; are homogeneous elements of R*. We claim the x; are 
generators of R as Ro-algebra. It suffices to show that every homogeneous element can be 
written as a polynomial in the x; with coefficients in Rọ. We use induction on the degree of 
homogeneity. The result is clear if the degree is 0. Now suppose that u is homogeneous of 
degree n > 0. We can write u = >)"; u; x; where u; € R. Equating homogeneous parts, we may 
assume that u; is homogeneous of degree n — deg x; < n, since x; € R*. Then u; is a polynomial 
in the x; with coefficients in Rp and hence the same is true of u =}; uxx; 


Now suppose that R is noetherian and M is a finitely generated graded R-module. We may 
choose a set of generators {u), ..., u,.} for M that are homogeneous. If the x’s are chosen as 


before, then it is readily seen that every element of M, is an Rp-linear combination of elements 
yju; where y; is a monomial in the x’s and deg y; + deg u; = n. Since the number of these 
elements is finite, M, is finitely generated as Rp-module. 0 


We now assume that (1) R is noetherian, (2) Rọ is artinian (as well as noetherian), and (3) 
M is a finitely generated graded R-module. Then every M, is finitely generated as Ro-module 
and hence is artinian and noetherian. Hence M, has a composition series as Ro-module and we 


have a uniquely determined non-negative integer /(Z,,), the length of the composition series for 
M,,. To obtain information on these lengths we introduce the generating function or Poincare 
series of the graded module M as the element 


(38) P(M,t) = (M,)e 
0 


of the field @((¢)) of formal Laurent series (p. 425). We can now state the following 


THEOREM 7.24 (Hilbert-Serre). Jf R is generated by homogeneous elements x, ..., X 
where deg x; = 0, then P(M, t) is a rational expression of the form 


(39) fO / [a-e 


where f(t) € 2[¢]. 


m 


For the proof we require the 


LEMMA. Let 0 —> M)— M; —... — M, — 0 be an exact sequence of finitely generated 
Ro-modules for an artinian ring Ro and let I(M,) be the length of M,. Then Ssl- 4M) = 9, 


Proof. The result is immediate from the definition of exactness if s = 0 or 1. If s = 2, we 
have the short exact sequence 0 — Mo —> M, — M, — 0. Then M, = M,/M,) and (M3) = I(M;) 


— (Mọ), which gives /(M,)—/(M,) + (M3) = 0. Now suppose that s > 2. Then we can imbed the 
given sequence in a sequence of homomorphisms 


0> M, >imM> M,>imM,>:::>imM,_,>M,-0 


where M; — im M, is obtained by restricting the codomain of M; —> M; + ı and where im M; > 


M; 1 is an injection. Then we have the exact sequences 0 — Mo —> im Mọ > 0, 0 —> im M; _ | 


— M, > imM, > 0,1<k<s—1,0>imM,_,;— M, — 0, which give the relations 
(My) = (im Mo) 
I(M,) = (im M,-,)+/(imM,), 1<k<s-l 
I(M,) = I(im M,_,). 


If we take the alternating sum of these equations, we obtain the required relation Le(—!)(M,) = 0 
m 


We can now give the 


Proof of Theorem 7.24. We use induction on the number m of homogeneous generators. If m = 
0, R= Rọ and M is a finitely generated Rp-module. Then M, = 0 for sufficiently large n and the 


result holds with f(t) = P(M, t). 

Now assume the result holds if R has m — 1 homogeneous generators. We act with x,, on M. 
This is an R-endomorphism x*,,, sending M, — Mn + em n = 0, 1,2,.... Hence ker x*,, and im 
xX* „n are homogeneous submodules, so we have the graded modules K = ker x*,, = ®K,, C= 
coker x* „= ®C,,. We have exact sequences 


0> K,7 M, > Mite, Caten >l. 


Hence, by the lemma, 
(M,)— (Mite) = I(K,) jë Cartea) 
If we multiply this relation by ¢” * &» and sumon n =0, 1, 2, ..., we obtain 


(40) (t°"—1)P(M,t) = P(K,t)t°"— P(C,t)+g(t) 


where g(t) is a polynomial in t. Now x,,K = 0 and x,,C = 0. Hence C and K are effectively 
R/Rx,, modules. Since R/Rx,, is a graded ring with m — 1 homogeneous generators of degrees 
ej .--» €m — 1 and since C and K are graded modules for this ring, we have 
Pk. = fT 1). PIC.) = ANTI A-1), Substitution of these in (40) gives the theorem. O 


The most important case of the foregoing result is that in which the generators can be 
chosen in R}, that is, the e; = 1. This is the case if R = Ro[x1, ..., Xm] where Rọ is artiman, the x; 
are indeterminates, and the grading is the usual one in which R, is the set of homogeneous 
polynomials of degree n. In fact, this case can be regarded as the most general one, since any 
graded ring with the e; = 1 is a homomorphic image of Ro[xo, ..., Xm] under a graded 
homomorphism and any module for the image ring can be regarded as a module for Ro[x), ..., 
Xm]. For the applications it is convenient, however, to assume that R is any ring satisfying our 
earlier conditions and the condition that R is generated by elements of Rj. In this case, 
Theorem 7.24 states that P(M, t) =f(t)/(1 — A”. We can use this to prove the main result on the 
lengths /(M,): 


THEOREM 7.25 (Hilbert-Serre). Suppose that R is generated by m homogeneous 
elements of degree 1. Then there exists a unique polynomial W) of degree <m — 1 with 


rational coefficients such that I(M,,) = '(n) for sufficiently large n. 


Proof. We have P(M, t) = (Mo = AN0- ang Ad = (a+) where deg r(t) < m. Then 


r(t) = 4 +4,(1=1)+°* +ay-,1-9""", a, E Q, and hence 


f(t) E | l 
ane Oto tO Gp 


It is readily seen, using the differentiation formula ((!=")""Y = kd =9""*"" that 


l | 
(aa Genin By TED ote 


(41) $01 


1-t 


The coefficient of ¢” on the right-hand side is a polynomial in n of degree k — 1 with rational 
coefficients. Hence the coefficient of t” in %0 -0 +a- "+= tas-1- is (n) where F is a 
polynomial of degree <m — 1 with rational coefficients. By (37), T(n) = I(n) if n > deg q(t). The 
uniqueness of T is clear, since a polynomial € Q(t) of degree < m — 1 that vanishes for m 
consecutive integers must be 0. O 

The polynomial T = (M) is called Hilberfs (characteristic) polynomial for the graded 
module M. Theorem 7.25 shows that T has integral values at the nonnegative integers. This 
does not imply that T € 2[t]. However, it is easy to determine the polynomials having this 
property. For any n = 1, 2, 3, ... we define the polynomial 


(49) x _X(x— De ent )) 


n n! 


and we put ®) = 1. Evidently the polynomials ©) € Q[x] and since deg ®© = n, the © form a base 
for Q[x]/ @. As with binomial coefficients, we have 


(43) A x\_ x+! _([X\_f x 


n n n n-1/ 


If x is an integer, then ™ is a product of n consecutive integers divided by n! It is readily seen 
that this is an integer. Hence any integral linear combination of the polynomials 1, {),..., © has 
integral values for integral values of the argument. Conversely, we have 


PROPOSITION 7.28. Let f(x) be a polynomial of degree n with rational coefficients such 
that f(x) takes on integral values for some set of n consecutive integers. Then f(x) is an 
integral linear combination of the polynomials “0 <j <n. 

Then 
Af (x) = f(x+ I)—f(x) = a: + a2(7) +" +ae(n* 1) Using induction on the degree we conclude that every a; 


Proof. We can write f(x) = atali+ +a) with rational a; 


with i > 0 is an integer. It follows also that aj € Z. D 


EXERCISES 


1. Determine the lengths /(R,,), n = 0, 1, 2, ..., for Rp, the set of homogeneous polynomials 
of degree n in R= Fx), ..., Xml, F a field. Determine P(R, t) as rational expression in t. 


2. Let f be an integral valued function on N = {0, 1, 2, ..., } such that An + 1) — f(n) = Fn) 
where I(t) € Q[f] has degree m — 1. Show that there exists a polynomial J (t) € aft] of 
degree m such that f(n) = J (n), n e N. 


7.16 THE CHARACTERISTIC POLYNOMIAL OF A NOETHERIAN 
LOCAL RING 


We consider first a general construction of a graded ring G(R) and a graded module G({M) 
from an ideal J and module M for the ring R. We put GAR)= err", G(M) = ®I"M/I"*'M, n >0, 
where 7? = R. If a, €e IŻ + ! and SEPM/P™'M, we have &= atl", = xt" M, EM, Then 
ax+l""""M is independent of the choice of the representatives a; x; Hence if we put 
GX; = Xp (La,)(LX;) = Lax; = Lax, these are well defined. It is readily verified that if we take M = 
R, we obtain the structure of a ring on G(R) with unit 1 + J. Also G{M) is a graded module for 
GR) with the 7 ”M/I ” * 1M as the homogeneous components. We shall call G(R) and GM) 
the graded ring and graded module associated with the ideal I. 

There is an alternative way of defining G(R) and G({M). We consider the rings T and 
modules 7M we introduced in Section 7.14 for the proof of the Artin-Rees lemma. We defined 
T = Xasol"*" = R[x], x an indeterminate, and TM = X.20(l"M)x" € M[x] = R[x]@eM_, It is clear that T is 
a graded ring with nth homogeneous component T, = J"x” and TM is a graded module for T 
with nth homogeneous component (/”M)x”". We have a homomorphism of graded R-algebras of 
T onto G(R) mapping bx*, b e Ik, k > 1, onto b + J% *!. The kernel is IT. Hence we can 
identify G(R) with T/T. Similarly, we can identify G(M) with TM/ITM, where ITM = Y(I" + 
IM)x". 

We showed in Section 7.14 that if R is a noetherian ring, then 7 is noetherian and, 


moreover, if M is a finitely generated R-module, then 7M is a finitely generated 7-module. 
Using the foregoing isomorphisms, we obtain the following 


PROPOSITION 7.29. Jf R is noetherian, then G(R) is noetherian for any ideal I and if R 
is noetherian and M is a finitely generated R-module, then G{M) is a finitely generated 
G,(R)-module. 


In the situation in which R// is artinian we could apply the results of the last section to 
define a Hilbert polynomial for the graded module G,(M). For, it is clear that G(R) is 


generated by its homogeneous component 7/1 of degree one. We shall not pursue this further 
but instead we specialize to the case of primary interest in which R is local noetherian with 
maximal ideal J and 7 = Q is a j-primary ideal ( = primary ideal with J as associated prime). 
We recall that since J is maximal, any ideal Q such that J? c Q c J for some e > 1 is J-primary 
(Proposition 7.24, p. 435). On the other hand, if Q is J-primary, then Q c J and Q > J for 
some e (Theorem 7.22, p. 440). Hence the condition on Q is equivalent to J? c Q c J for some 
e. The condition we require that R/Q is artinian is certainly satisfied since R/Q is a local ring 
whose maximal ideal J/Q is nilpotent (Theorem 7.15). 

We can apply the results on the Hilbert polynomial to the ring G(R) and module Go(R). 
Suppose that y}, ..., Ym are elements of Q such that the cosets y; + Q? generate Q/Q? as R- 
module. Then it is readily seen that the cosets y; + Q? generate Q/Q? + 07/Q? + ... as Go(R)- 
module. Hence, by the proof of Proposition 7.27, these elements generate Go(R) as algebra 
over R/Q. It follows from the theorem of Hilbert-Serre that there exists a polynomial I(t) € @ 
[t] of degree < m — 1 such that T(n) = 1(0"/O" + ') for sufficiently large n where 1(Q”/O" 7 ') is 
the length of 0”/O" * | as R/O-module. 

We now switch to R/Q” as R-module. We have the chain of submodules RD OD Q >... D 
QO” with factors R/O, O/O’, ..., O” ~ '/O”, which can be regarded as R/O-modules. As such 
they have composition series and lengths. Hence this is the case also for R/Q” that has a length 
1(R/Q"). Since !(R/Q"*')=(R/Q") = (Q"/Q""') = lin) for sufficiently large n, it follows that there exists 
a polynomial yo" € Off] of degree <m such that y9"(n) = 1(R/Q") for sufficiently large n 
(exercise 2, p. 448). We can state this result without reference to the graded ring Go(R) as 
follows: 


THEOREM 7.26. Let R be a noetherian local ring with maximal ideal J and let Ọ be a J- 
primary ideal. Suppose O/O* is generated by m elements as R-module. Then for any n= 1, 2, 
3, ..., R/O” has a composition series as R-module and if I(R/Q") is its length, then there 
exists a polynomial xo" € Q[¢] of degree <m such that I(R/Q") = Xo" (n) for sufficiently large 
n. 


The polynomial Xo" is called the characteristic polynomial of R relative to Q. The most 


important thing about this polynomial is its degree. For, this is independent of the choice of Q 
and hence is an invariant of R. To see this, let O' be another J-primary ideal. Then there exist 


positive integers s and s’ such that O° c Q and Q5 c Q'. Then R > O" > Q” and (R/Q") < 
I(R/Q' 5"). Hence xe") < 284s") for sufficiently large n. This implies that deg *?" $ 4°84% and by 
symmetry deg xg” = deg yg". We shall denote the common degree of the 79% by d = d(R). 


7.17 KRULL DIMENSION 


We shall now introduce a concept of dimension for noetherian rings that measures the size of 
such a ring by the length of chains of distinct prime ideals in the ring. For our purposes it is 
convenient to denote a finite chain of distinct prime ideals by Po ? P: #'° # and call s the 
length of the chain. With this convention we have the following 


DEFINITION 7.4. Jf R is a noetherian ring, the Krull dimension of R, denoted as dim R, is 
the supremum of the lengths of chains of distinct prime ideals in R. If no bound exists for 
such claims, then dim R = œ 


We remark that the Krull dimension of a noetherian ring may be infinite. Examples of such 
rings have been given by M. Nagata. We observe also that in considering a chain P = 
Po ? P, @ ®P, we can pass to the localization R with maximal ideal P. The correspondence 
between prime ideals of Rp and prime ideals of R contained in P gives a chain of prime ideals 


Pr? Pir? = Pw in R, We shall therefore consider first dimension theory for noetherian local 
rings. If R is such a ring with maximal ideal J, then it is clear that it suffices to consider chains 
that begin with Py) = J. The fundamental theorem on dimension in noetherian local rings is the 


following 


THEOREM 7.27. Let R be a noetherian local ring. Then the following three integers 
associated with R are equal: 
(1) The degree d = d(R) of the characteristic polynomial xo" of any J-primary ideal Q 
of R. 
(2) The minimum number m = m(R) of elements of R that generate a J-primary ideal Q 
of R. 
(3) dim R. 


(The first two numbers are evidently finite. A priori the third may not be. The assertion of 
the theorem is that it is, and it is the same as d(R) and m(R). Thus the theorem implies that the 
dimension of any noetherian local ring is finite.) 


Proof. The theorem will be proved by showing that d < m < dim R < d. 


d< m. This is a part of Theorem 7.26. 

m < dim R. There is nothing to prove if dim R = œ, so we assume that dim R < œ. We use 
induction on dim R. Now dim R = 0 if and only if J is the only prime ideal in R. In this case R 
is artinian by Theorem 7.14. Then J? = 0 for some e and 0 is J-primary. Hence m = 0. Thus the 
inequality m < dim R holds if dim R = 0. Now suppose that 0 < dim R < œ. Let P}, ..., P, be 


the minimal primes of R (see p. 433). Since any prime ideal in R contains one of the P; and 
every prime ideal is contained in J, J # P, 1 < i <r. Hence, by the prime avoidance lemma (p. 
390) there exists an element x € J, UP.. Consider the ring R’ = R/Rx. This is a local ring with 
maximal ideal J/Rx. Any chain of distinct prime ideals in R’ has the form P‘o/Rx P? PW Rx 
where the P’, are prime ideals in R containing Rx. Now P', contains one of the minimal primes 


P, and P’, £ P; since x É P,. Hence we have a chain of prime ideals Po? +? Ps? Pi for some i. 
This shows that dim R' < dim R — 1. On the other hand, if y; + R,, ... Ym + R is a set of 
generators for a J-primary ideal QR, in R’, then y}, ..., Ym x is a set of generators for the J- 
primary ideal Q'in R. Hence m(R) < m(R’) + 1. The induction hypothesis gives m(R’) < dim R’. 
Hence m(R) < m(R') + 1 < dim R'+ 1<dimR. 

dim R < d. We shall prove this by induction on d. If d = 0, then /(R/J”) is constant for 
sufficiently large n. Hence J” = J” *! for sufficiently large n and J” = 0 by Nakayama’s lemma. 
Hence R is artinian and J is the only prime ideal in R. (Theorems 7.14 and 7.15, pp. 426—427.) 
Then dim R = 0 and dim R = 0 =d. 

Now assume d > 0. We have to show that if Pp =/ ? Pi P|? P, is a chain of prime ideals in 
R, then s < d. We reduce the proof to the case of a domain by forming the domain R/P,, which 
is local with maximal ideal J/P, and has the chain of prime ideals //P: 2 Pi/P, 2°" ẹ PVP, = 0. We 
have (J/P,} = (J"+ PVP, so (R/P JP," = (RIPA + PAP.) & RAS" + Py), Hence 
NUR/PA/U/PJ) = KRU" + PD) SURI. Tt follows that d(R/P,) < d(R). Hence it suffices to prove s < d 


in the case in which R is a domain and we have the chain of prime ideals J = Po P> ? Ps 29, 
Letx € P,_1,x #0, and consider R/Rx, which is local with maximal ideal J/Rx and has the 
chain of prime ideals //R* 2? ? ?s-1/R*, We wish to show that d(R/Rx) < d(R) — 1. As above, 
1((R/Rx)/(J/Rx)") = (R(J" + Rx)). Now (R(J" + Rx)) = (RIP — IJ" + Rx)/J") and 
IU(J" + RxyJ") = (Rx/J" A Rx), By the Artin-Rees lemma there exists a positive integer k such that 
J*a Rx =J*™*U* A Rx) for all n > k. Hence 


((J"+Rx)/J")) > U(Rx/J"~ *x). 


Since x # 0 in the domain R, multiplication by x is an R-module isomorphism of R onto Rx and 
this maps J” 74 onto J” ~ *x. Hence R/J” 74 = Rx/J"~ *x as R-modules and so [(Rx/J" 7 x) = 
(RI J" ~ £). Hence 


I((R/Rx)/(J/Rx)" = I(R/(J" + Rx)) 
= [(R/J")—I(Rx/J" ^ Rx) 
< I(R/J")—I(R/J"~*). 
This implies that 
d(R/Rx) < d(R)-1. 
Using the induction on d we conclude that s — 1 < d(R) — 1 and so s < d(R). D 
We shall now apply Theorem 7.27 to arbitrary noetherian rings. If R is such a ring, we 


define the height of a prime ideal P in R as the supremum of the lengths of chain of prime 
ideals Po= PF P? F? Ps, It is clear that the height, ht P, is the dimensionality of the noetherian 


local ring Rp. This is finite by Theorem 7.27 Evidently this has the following rather surprising 
consequence. 


THEOREM 7.28. Any noetherian ring satisfies the descending chain condition for prime 
ideals. 


Another quick consequence of the main theorem is the 


GENERALIZED PRINCIPAL IDEAL THEOREM (Krull). Jf I is a proper ideal in a 
noetherian ring and I is generated by m elements, then any minimal prime over I has height 
<m. 


Proof. Let P be a minimal prime over Z. The assertion is equivalent to ht P, < m in Rp. Now 
P, is a minimal prime over Jp in Rp and since P, is the only maximal ideal in Rp, P, is the only 
prime ideal of Rp containing Ip. Hence Ip is Pp-primary. On the other hand, if yp ..., Ym are 
generators of /, then y,/1, ..., y,,/1 generate Ip. Hence ht P, = dim Rp < m. O 


A special case of the foregoing theorem is the 


PRINCIPAL IDEAL THEOREM Let y be a non-unit in a noetherian ring R. Then any 
minimal prime ideal over R, has height < 1. 


It is clear from the definitions that the Krull dimension of a noetherian ring is the supremum 
of the heights of its maximal ideals or, equivalently, the supremum of the Krull dimensions of 
its localizations at maximal ideals. 


We now consider the Krull dimension of a polynomial ring Fx), ..., Xm] in m 
indeterminates over an algebraically closed field F. We recall that any maximal ideal M in 
F(X, -.-, Xm] is generated by m elements x, — 44, X2 — Ap, ..., Xj — Ay» ai € F (p. 426). Hence ht 
M < m. On the other hand, we have the chain of prime ideals 

m-1 


(44) Y R(x—a;) = >, R(x —a;) P 2 R(xX,—a,) 2 0. 
1 


It follows that ht M = m. Since this holds for any maximal ideal, it follows that dim F[x, ..., 
Xm] = mM. 
We shall obtain next an important application of the generalized principal ideal theorem to 


systems of polynomial equations with coefficients in an algebraically closed field. This is the 
following 


THEOREM 7.29. Let F be an algebraically closed field and let fi, ..., Xm) <- JX] «++ 
Xm) E R= F[x, ..-,X,| where the x; are indeterminates. Assume that the system of equations 


(45) iia oy) = G org hes aXe) =O 


has a solution in F(m) and m > n. Then (45) has an infinite number of solutions in F(m) 


Proof. Assume to the contrary that the system has only a finite number of solutions: 
(Pah, Let J = 5", Rf, so the variety VD) = S = ah... Let 
l= $F- Rua), 1 Sj <r Then V(I) consists of the single point (a,, ..., On”) and S = 
Ui vu) = v] i) (p. 430). Hence if r = Iiu, then VU) = S = VU) and so, by the Hilbert 
Nullstellensatz, nilrad / = nilrad /’. Let P be a minimal prime ideal over Z. Then P > nilrad J = 
nilrad Hence P > /'=I1J;, so P > 1; for some j and since J; is maximal, P = J;. We have seen 


that the height of any maximal ideal in R is m. Hence the height of any minimal prime over J = 
>" Rf, is m> n, contrary to the generalized principal ideal theorem. O 


A useful special case of this theorem is obtained by taking the f; to be polynomials with 0 


constant terms (e.g., homogeneous polynomials of positive degree). These have the trivial 
solution (0, ... ,0). The theorem shows that if there are more unknowns than equations, then the 
system (45) has a nontrivial solution. 


If R is a noetherian local ring with maximal ideal J and dim R = m, then any set of m 
elements such that y4, ..., y,, Such that X”; Ry; J-primary is called a system of parameters for 


R. Of particular interest for geometry are the local rings R for which J itself can be generated 
by m = dim R elements. Such local rings are called regular. The following exercis € S 
indicate some of their properties. 


EXERCISES 


— 


. Show that if R is a noetherian local ring with maximal ideal J, then [J?/J : R/J] > dim R 
and equality holds if and only if R is regular. 


2. Show that R is regular of Krull dimension 0 if and only if R is a field. 


3. Let R be a regular local ring with J 0. Show that / 2/’. Show that ifx € j —J*, then R' = 
R/Rx is regular with dim R' = dim R — 1. 

4. Let R, be a principal prime ideal in a commutative ring R. Show that if P is a prime ideal 
such that R,  P, then *< N=: (RP Use this to prove that if R is a local ring that is not a 
domain, then any principal prime ideal in R is minimal. 

5. Prove that any regular local ring is a domain. (Hint: Use induction on dim R. The case 
dim R = 0 is settled by exercise 2. Hence assume that dim R > 0. By exercise 3 and the 
induction hypothesis, R/Rx is a domain for any x € J—J*. Hence Rx is a prime ideal for 
any x € J — J. Suppose that R is not a domain. Then by exercise 4, Rx is a minimal 


prime ideal for any x € J—J*. Let P}, ..., P, be the minimal prime ideals of R. Then J — 
J? =UiP: and 734 Ui", By the prime avoidance lemma, J c p,. for some i. Then J = P; is 
a minimal prime ideal in R. Hence dim R = 0, contrary to hypothesis.) 


6. Let M be a module for a commutative ring R. A sequence of elements 44,47, ..., a,, of R is 
called an R-sequence on M if (i) (284M #M and (ii) For any i, 1 <i < n, a; is not a zero 
divisor of the module M />'~ ! Ra;. If M = R, we call the sequence simply an R- 
sequence. 

Show that if R is a regular local ring with dim R = m and J = rad R, then J can be 
generated by an R-sequence of m elements contained in J — J”. 


7. Let R be a noetherian local ring with maximal ideal J. Show that if J can be generated by 
an R-sequence, then R is regular. 


7.18 J-ADIC TOPOLOGIES AND COMPLETIONS 


Let J be an ideal in a ring R that for the present need not be commutative, M an R-module. We 
can introduce a topology in M, the /-adic topology, by specifying that the set of cosets x + PM, 
xe M,x € M,n =1, 2, 3, ..., is a base for the open sets in M. If ze(¢+!"M)ov+!l") and q = 
max(m, n), then =+/*M = (x+I"M)^ (y +I"M), Since, by definition, the open sets are the unions of the 
cosets x + I”M, it follows that the intersection of any two open sets is open. The other axioms 
for open sets—namely, that the union of any set of open sets is open and that M and Ø are open 
—are clearly satisfied. Hence we have a topological space. In particular, if we take M = R, we 
obtain the /-adic topology in R in which a base for the open sets is the set of cosets x + 7”, x € 
R. 

We recall that a group G is called a topological group if it has a topology and the map (x,y) 
~ xy | of G x G into G is continuous. A ring R is a topological ring if it has a topology such 
that the additive group is a topological group and multiplication is continuous. A module M for 
a topological ring R is a topological module if it is a topological group and the map (a, x) + 
ax of R x M into M is continuous. 

It is readily verified that R endowed with its /-adic topology is a topological ring and if M 
is a module endowed with the /-adic topology, then M is a topological module for R. 

The J-adic topology in M is Hausdorff if and only if !™={\!"™ =0, For, if this condition 
holds, then given distinct elements x, y in M there exists an n such that x — y É I'M, which 
implies that the open neighborhoods x + /"M and y + I’M of x and y respectively are disjoint. 
Conversely, if/\!"” #0, letz #0 be in N J"M. Then any open set containing 0 contains z. Hence 
z is in the closure of {0} and M is not a 7|-space and hence is not Hausdorff. 

From now on we shall assume that 1” = 0 and 1 M = 0, so the J-adic topologies in R and M 
will be Hausdorff. We recall that these conditions are satisfied if R is a commutative 


noetherian ring, /= rad R, and M is a finitely generated R-module (Theorem 7.23, p. 443). We 
recall also that if R is a commutative noetherian ring and / is any ideal in R, then (1 + by’ = 0 
for some b e J. It follows that if J is a proper ideal in a noetherian domain, then 1 = 0. An 
important special case is obtained by taking R = Fx, ..., x,,] where F is a field and J = )’Rx,, 
the ideal of polynomials vanishing at (0, ..., 0). 

As in the case of rings of power series, we can introduce the concepts of convergence, 
Cauchy sequences, etc. It is convenient to base the definitions on the notion of the order of an 
element with respect to the /-adic topology. Let x be a non-zero element of M. Since M > IM > 
PM >... > and \/"M = 0, there exists a largest integer n such that x € I”M. We define this 
integer to be the order o(x), and we complete the definition of the order function by putting 
o(0) = œ. We use these definitions also in R. Our definitions evidently imply that o(x) = œ if 
and only if x = 0, o(—x) = o(x), o(x + y) = min(o(x), o(y)) and if a € R, then o(ax) = o(a) + 
o(x). Hence, if we put |x| = 2 ~ ° and | a | = 27 °™ (with the convention that 27 ® = 0), we 
obtain 


(1) IxieO,  ixi=O<x= 0, 
(i1) |— x] = Ixl, 
(iii) |x+y| < max(|x|,|yl), 
(iv) |ax| < |a||x]. 


These hold also for R and | | define metrics in M and R by d(x, y) = x — y|. Then we have the 
usual properties: d(x, y) = d(y, x), d(x, y) = 0, and d(x, y) = 0 if and only if x = y. In place of 
the triangle inequality d(x, y) < d(x, z) + d(z, y) we have the stronger inequality 


(46) d(x, y) < max(d(x,z),d(z, y)). 


This has the curious consequence that all triangles are isosceles, that is, for any three elements 
x, yY, Z, two of the distances d(x, y), d(x, z), d(y, z) are equal. For,(46)and the symmetry of d(,) 
implies that no one of the numbers d(x, y), d(x, z), d(y, z) can exceed the other two, hence some 
two are equal. 

The metric d(x, y) = x — y| can be used to introduce convergence of sequences, Cauchy 
sequences, etc. in the usual manner. We say that the sequence of elements {x,} converges to x 


and we write limx, =x or x, — x if for any real € > 0 there exists an integer N = Me) such that 
x — x,|<e for all n > N. As usual, we have that x, — x andy, —> yimplyx, t y, > x+y and 
x, > x anda, — a imply a,x, — ax. 

The sequence {x,} is called a Cauchy sequence if for any e > 0 there exists an integer N 
such that x, — x,„| < e for all m,n > N. Since for m >n 


xx! = [tee Mea oe Xm -2) + _ + adi —X,)| 


S max (jX = Xm - 1 , a l — Xm -2l ANS IXn+ 1 =h 


{x,} is a Cauchy sequence if and only if for any e > 0 there exists an N such that |x, + 1 — x,| < E€ 
for all n > N. We call M complete in the J-adic topology if every Cauchy sequence of elements 
of M converges. 

We also have the usual definitions of convergence of series: EFX, = *%:+*:+° converges if 
the sequence of partial sums s = x, + x7 +... +x, converges. If s, — s, then we write s = x, + 


X2 + .... We note that if M is complete, then a series x;tx, + ... converges if and only if its nth 
term x, converges to 0. For, if this condition holds, then the sequence of partial sums is Cauchy 
by the remark we made before. Hence x,+x,+ ... converges. Conversely if x, + x, + ... 
converges, then s, — sands, _ | > S, SOx, =S,—S,_,— 0. 

This simple criterion for convergence of series in a complete M can be used to show that if 
R is complete relative to the /-adic topology, then any b € J is quasi-invertible. For, since b” € 
I", b” — 0. Hence the series 


1+b+b?+°° 


converges in R. Since 


(1+b+-::+b""')(1—b) = 1-b" = (1—b)(1+b+: +b""'), 


we obtain on passing to the limit that (1 + b + b? +... )(1—b)=1=(1—-))1+6+06? +... ). 
Hence b is quasi-invertible with }°”, b” as quasi-inverse. Thus every element of the ideal J is 
quasi-invertible and hence 7 c rad R, the Jacobson radical of R. We record this result as 


PROPOSITION 7.30. Jf R is complete relative to the I-adic topology, then I C rad R. 


If we know in addition that R/J is semi-primitive, then / = rad R. For example, this will be 
the case if / is a maximal ideal in R. 

Another important property of complete rings relative to /-adic topologies is the following 
idempotent lifting property that is closely related to Proposition 7.14. We recall that 
Proposition 7.14 played an important role in the study of the topology of Spec R, the space of 
prime ideals in R. 


THEOREM 7.30. Let R be a ring that is complete relative to the I-adic topology defined 
by the ideal I and let ù= u + I be idempotent in R = R/T. Then there exists an idempotent e in 
R such that è = i. 


Proof. We assume first that R is commutative. In this case we can apply Proposition 7.14, 
including the uniqueness part to the ring R//"”, n = 1, 2, .... We have the ideal ///” in R//” and 


I/I” is nilpotent and (u + I")? = u + I"(mod 7/7”). It follows from Proposition 7.14 that there 
exists a unique element e,, + 7” in R/I” such that (e, + J")? =e, + I" and e, = u(mod J). Then e, 
=e,(mod 7”), e, = u(mod J), and if e’, is a second element satisfying these conditions, then e’, 
= e,(mod 7”). Since P” > P tk k=1,2, ..., we have e, +k = €„(mod 7”). Then {e,} is Cauchy, 


so {e,} converges to, say, e. We have e? = lime,” = lime, = e and u = e (mod J). 


Now let R be arbitrary. Let R, be the closure in R of the subring generated by u. Then R, is 
commutative and if /; =Z N R}, then /, is an ideal in R, such that /:"=°. Moreover, since R} is 
closed in R, R, is complete relative to the 7}-adic topology. We have (u + J,)* = u + 1}. The 
result now follows by applying the first part to the commutative ring R,;. D 


An important technique in commutative ring theory is to pass from a ring R to its 
completion. We proceed to give constructions of the completion of a ring R and of a module M 
relative to their /-adic topologies (assuming 7° = 0 and J® M = 0). Let C(M) denote the set of 
Cauchy sequences of elements of M. C(M) has a natural module structure, since the sum of two 
Cauchy sequences is a Cauchy sequence, and if {x,} is a Cauchy sequence and a € R, then 
{ax, } is a Cauchy sequence. C(M) contains the submodule of constant sequences {x} and the 
submodule Z(M) of null sequences {z,}, which are the sequences such that z, — 0. We can 
form the modules M = C(M)/Z(M) and R = C(R)/Z(R). If {a,} E C(R) and{x,} E€ C(M) , then 
{a,X,} E C(M) and this is contained in Z(M) if either {a,} € Z(R) or {x,} € Z(M). These 
facts are readily verified. It follows that Z(R) is an ideal in C(R) and hence R has the structure 
of a ring. Moreover, if å e Ê and x € M, we can define å< by taking representatives {a,}, 
{x,} of @, X and defining 4x = {a,x,} + Z(M). In this way M becomes an R-module. 

Since Z(M) contains no constant sequences except {0}, we can identify M with the set of 
cosets (with respect to Z(M)) of the constant sequences. In particular, we have an identification 
of R with a subring of Ê and M is an R-submodule of M. 

Now let Î denote the subset of Ê of elements {a,} + Z(R) such that every a, € I. Then I 
is an ideal in R and Î* is the set of elements {b,,} + Z(R) such that b, € 7. It follows that 
iv =0. Similarly, iv =0, so we have Hausdorff topologies in Ê and M defined by /. It is clear 
that the induced topologies on M and R are the /-adic topologies. Moreover, if {x,} + Z(M) is 
an element of M, then {x,} + Z(M) = lim x, in M. Hence M is dense in M. Moreover, M is 
complete. For, let {x,“} + Z(M), k = 1, 2, ..., be a Cauchy sequence in M. For each TADH 


th) y — th > 
we can choose x e M so that |i") +Z(M))-x™|< 1/2 


. It is easily seen that x = {x,} is a 
Cauchy sequence and lim({x,“)} + Z(M)) = x. Thus M is complete and, similarly, R is 
complete relative to the J -adic topologies. We call M and Ê the completions of M and R 
respectively. 

We shall now specialize these considerations to the most interesting case in which R is a 
commutative noetherian ring and M is a finitely generated R-module. With the notations as 


before, we prove first the following 


PROPOSITION 7.31. M = ÊM. 


Proof. Let {x}, ...,x,} be a set of generators for M as R-module and let Î be any element of 
M. Then Ÿ= lim y, where the y, € M. Evidently y, + 1 — Y, > 0, so if o(y, +1- Yn) = Sp, then 
S, — ©. Since M = ERx, IM = El"x, and hence we can write »+ı=» = 2-140) where a,j € Pe 


We now write y; =)" bx; where b;; € R. Then 
r 
Yn F Yı + (y2 =y pee (Yn — Yn- )= : b,j; 
j=1 


where 6,; = by + ayt ... + a, — y. The r sequences {b,j}, 1 <j <r, are Cauchy sequences. 
Hence lim, _, » {b,;} = 6; in Ê. Then ¥=limy, = Z b ,x;. Thus M = RM. O 


We have defined the ideal Î in R as the sets of cosets {a,} + Z(R) where the a, € I. 
Evidently Zc Î and hence RI c J. Since {a,} + Z(R) =lima,, it is clear that J is dense in J. 
The proof of the foregoing proposition shows that RJ = I. We consider next a submodule N of 
M. Since R is noetherian and M is finitely generated, N is also finitely generated. Evidently 
IPN c IPM =0. We have the /-adic topology of N and the induced topology in N as a subspace of 
M. We shall now prove 


PROPOSITION 7.32. Jf N is a submodule of the module M for the module M for the 
(commutative) noetherian ring R and I is an ideal in R, then the I-adic topology in N 
coincides with the topology induced by the I-adic topology of M. 


Proof. A base for the neighborhoods of 0 in the /-adic topology of N is the set {/”N} and a 
base for the neighborhoods of 0 in the induced topology is {M N N}. To show that the two 
topologies are identical, that is, have the same open sets, it suffices to show that given any PN 
there exists an/”M N N c I'N and given any M N N there exists an JIN such that IIN c PM 
N N. Since PN c I"M N N, the second is clear. To prove the first we use the Artin-Rees 
lemma. According to this there exists an integer k such that for any q > k, IM N N= 117 *UkM 
N N). Hence if we take q =n + k, we obtain “MON = MU'M AN) e I'N, O 


We shall prove next the important result that the completion of a noetherian ring relative to 
an J-adic topology is noetherian. The proof is based on the method of graded rings and 
modules that was introduced in section 7.16. Recall that we showed that if Z is an ideal in a 
noetherian ring R, then the associated graded ring GR) is noetherian and if R is noetherian and 
M is a finitely generated R-module, then the graded module GM) is a finitely generated 
G,R)-module (Proposition 7.29, p. 449). 


We shall need the following 


PROPOSITION 7.33. Let R be a noetherian ring, I an ideal in R such that I® = 0; Ê the 
completion of R relative to the I-adic topology. Let GKR) and G j (R) be the graded rings of 


Rand R associated with the ideals I and I respectively. Then G(R) = G(R) 


Proof. We have R;=T/I'*!,R,= 1'/1'*!. tis clear from the definition of Î that Î! N I! 
+1=7i+1_By the density of R in R, for any b e Îi, there exists a b € R such that b -be fit 
l Then b e FF = Ji A R. Hence Îi = F + Îi + | We now have 
R= PES s (PUNE S PEt T= TE = R, Tt follows from the definition of the multiplication 
in the graded rings that G48) = G(R), O 


We require also 


PROPOSITION 7.34. Let R be complete relative to an I-adic topology and let M be an R- 
module such that "M =0. Suppose that G{M) is finitely generated as G{R)-module. Then M 


is finitely generated as R-module. 


Proof. We choose a set of generators {'*1.---.* for GM) as G;(R)-module. We may assume 
that & = tl°"'M and ~wel"M, €1°"'M, We shall show that {x,, ...,x,,} generates M as R-module. 
We observe first that if # is a homogeneous element of G{M) of degree m, then we can write 
i= )4% where a; is a homogeneous element of G(R) of degree m — e; Now let u; be any 
element of M. Suppose that o(u,) = m1 > 0. Then ñ: =4:1+/""'MeM"* and ñ + 0. Hence there 
exist a); € ~ such that ® = L4%, Then = "—Lawiel™*'M and so o(uy) = m, > my. We repeat 
the argument with u, and obtain ax¢/™~* such that u- Eaxx:eI™*'M, Continuing in this way we 
obtain a sequence of integers m, < m, < m; < ..., a sequence of elements u, uz, .... where ug € 
mM, and for each i a sequence aj; az; .... where “€!"~" such that ~È asx: = "+1, Since R is 
complete, the infinite series >',a,; converge, say, >}; ap; = a; We claim that u; = J} ax; To see 
this we write 


| — Y ajx; = Uy -) [Yau 


= Ug+ 1 +3 ( y au jie. 
l= j 


+i 


Since J°M = 0, we have u; — ax; = 0 and u, = Jax; O 


If M is an ideal in R, then G({M) is an ideal in G(R). Hence we have the following 
consequence of the last result. 


COROLLARY. Let R be complete with respect to an I-adic topology and suppose that 
GR) is noetherian. Then R is noetherian. 


We can now prove our main result. 


THEOREM 7.31. Jf R is a commutative noetherian ring, I an ideal in R such that I° = 0, 
and R is the completion of R relative to the I-adic topology, then R is noetherian. 


Proof. By Proposition 7.29, G(R) is noetherian. Hence, by Proposition 7.33, G(R) is 
noetherian. Then R is noetherian by the Corollary. O 


The noetherian property of rings of power series over noetherian rings (Theorem 7.12, p. 
424) follows also from Theorem 7.31. Let R be noetherian and let Z be the ideal in R[x, ..., x,] 
of polynomials fx, ..., x,) such that f(0, ..., 0) = 0. Then /° = 0 and so we can form the 
completion of R[x), ..., x,.] relative to the /-adic topology. This is the ring R[[x, ..., x,]]. Since 
R[x), .--, X,] is noetherian by the Hilbert basis theorem, so is the ring R[[x), ..., x,]]. 


EXERCISES 
1. Let R be a ring and / an ideal in R such that 7° = 0. If m < n, then we have the canonical 


homomorphism @,,,, of R/I” — R/I”. These homomorphisms define an inverse limit 
lim R/I 


mn 


"as on p. 73. Show that this inverse limit is isomorphic to R. 


2. Show that if R is noetherian, then R is a flat R-module. 
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Field Theory 


In BAI we developed a substantial part of the theory of fields. In particular, we gave a rather 
thorough account of the Galois theory of equations and the relevant theory of finite dimensional 
extension fields. Some of the topics covered were splitting fields of a polynomial, separability, 
perfect fields, traces and norms, and finite fields. 

We now take up again the subject of fields and we begin with a construction and proof of 
uniqueness up to isomorphism of the algebraic closure of a field. After this we give an 
alternative derivation of the main results of finite Galois theory based on a general 
correspondence between the subfields of a field and certain rings of endomorphisms of its 
additive group. We also extend the finite theory to KrulPs Galois theory of infinite dimensional 
algebraic extension fields. An important supplement to the (finite) Galois theory is the study of 
crossed products, which is a classical tool for studying the Brauer group Br(F) associated with 
a field. We consider this in sections 8.4, 8.5, and 8.8. In sections 8.9 and 8.11 we study the 
structure of two important types of abelian extensions of a field: Kummer extensions and 
abelian p-extensions. These extensions can be described by certain abelian groups associated 
with the base field. In the case of the p-extensions, the groups are defined by rings of Witt 
vectors. These rings are of considerable interest aside from their application to p-extensions. 
The definition and elementary theory of rings of Witt vectors are discussed in section 8.10. The 
concept of normal closures and the structure of normal extensions are considered in section 
8.7. 

The first two thirds of the chapter is concerned with algebraic extension fields. After this 
we take up the structure of arbitrary extensions. We show that such extensions can be obtained 
in two stages, first, a purely transcendental one and then an algebraic one. Associated with this 
process are the concepts of transcendency base and transcendency degree. We consider also 
the more delicate analysis concerned with separating transcendency bases and a general notion 
of separability applicable to extensions that need not be algebraic. Derivations play an 
important role in these considerations. We show also that derivations can be used to develop a 
Galois theory for purely inseparable extension fields of exponent one. 


In section 8.13 we extend the concept of transcendency degree of fields to domains. In 
section 8.18 we consider tensor products of fields that need not be algebraic and in section 
8.19 we consider the concept of free composites of two field extensions of the same field. 


8.1 ALGEBRAIC CLOSURE OF A FIELD 


We recall that a field F is called algebraically closed if every monic polynomial f(x) of 
positive degree with coefficients in F has a root in F. Since r is a root, that is, f(r) = 0, if and 
only if x — r is a factor of f(x) in F[x], this is equivalent to the condition that the only 
irreducible polynomials in F[x] are the linear ones and hence also to the condition that every 
polynomial of F[x] of positive degree is a product of linear factors. Still another condition 
equivalent to the foregoing is that F has no proper algebraic extension field. For, if Æ is an 
extension field of F and a € E is algebraic over F, then [F(a) : F] is the degree of the minimum 
polynomial f(x) of a over F, and f(x) is monic and irreducible. Then a € F if and only if deg 
fix) = 1. Hence if E is algebraic over F and E ? F, then there exist irreducible monic 
polynomials in F[x] of degree > 2; hence F is not algebraically closed. Conversely, if F is not 
algebraically closed, then there exists a monic irreducible f(x) € F[x] with deg f(x) > 2. Then 
the field F[x]/(/(x)) is a proper algebraic extension of F. 

We recall that if E is an extension field of the field F, then the set of elements of £ that are 
algebraic over F constitute a subfield A of E/F (that is, a subfield of E containing F). Evidently 
E =A if and only if E is algebraic over F. At the other extreme, if A = F, then F is said to be 
algebraically closed in E. In any case A is algebraically closed in E, since any element of E 
that is algebraic over A is algebraic over F and so is contained in A. These results were 
proved in BAI, pp. 270-271, as special cases of theorems on integral dependence. We shall 
now give alternative proofs based on the dimensionality formula 


(1) [E:F] =[E:K][K:F] 


for E, an extension field of F, and K, an intermediate field. This was proved in BAI, p. 215, for 
finite dimensional extensions and the proof goes over without change in the general case. In 
fact, as in the finite case, one sees directly that if (u,) is a base for E/K and (vg) is a base for 


KIF, then (u,v) is a base for E/F. The dimensionality formula gives the following 


PROPOSITION 8.1. [E : F] < œ if and only if E = F(a,,..., a,) where a, is algebraic over F 
and every a; is algebraic over F(a),..., ;_ 1). 


The proof is clear. 

This proposition gives immediately the results that were stated on algebraic elements: If a 
and b € E are algebraic over F, then, of course, b is algebraic over F(a) so F(a, b) is finite 
dimensional. Then the elements a + b, ab, and a~! if a + 0 are algebraic. Hence the algebraic 


elements form a subfield A containing F. If c is algebraic over A, then evidently c is algebraic 
over a subfield F(ay,..., a,), a; E A. Then F(a),..., a,, C) is finite dimensional over F and c is 


algebraic over F, soc € A. 
We remark also that if the a; are algebraic, then 


(2) F (G3, 5045 0q) = Flisi Qyl. 


This is well known ifn = 1 (BAI, p. 214) and it follows by induction for any n. 
We prove next 


PROPOSITION 8.2. Jf E is an algebraically closed extension field of F, then the subfield 
A/F of elements of E that are algebraic over F is algebraically closed. 


Proof. Let f(x) be a monic polynomial of positive degree with coefficients in A. Then fx) 
has a root r in E and evidently r is algebraic over A. Hence r e A. Hence A is algebraically 
closed. O 


This result shows that if a field F has an algebraically closed extension field, then it has 
one that is algebraic over F. We shall now call an extension field E/F an algebraic closure of 
F if (1) E is algebraic over F and (2) E is algebraically closed. We proceed to prove the 
existence and uniqueness up to isomorphism of an algebraic closure of any field F. 

For countable F a straightforward argument is available to establish these results. We begin 
by enumerating the monic polynomials of positive degree as fix), hÆ), hx)... Evidently this 
can be done. We now define inductively a sequence of extension fields beginning with Fọ = F 
and letting F, be a splitting field over F;_ , of f(x). The construction of such splitting fields 
was given in BAI, p. 225. It is clear that every F; is countable, so we can realize all of these 
constructions in some large set S. Then we can take E = UF, in the set. Alternatively we can 
define E to be a direct limit of the fields F;. It is easily seen that E is an algebraic closure of F. 
We showed in BAI, p. 227, that if K| and K, are two splitting fields over F of f(x) e F[x], then 
there exists an isomorphism of K,/F' onto K,/F. This can be used to prove the isomorphism 
theorem for algebraic closures of a countable field by a simple inductive argument. 

The pattern of proof sketched above can be carried over to the general case by using 
transfinite induction. This is what was done by E. Steinitz, who first proved these results. 

There are several alternative proofs available that are based on Zorn’s lemma. We shall 
give one that makes use of the following result, which is of independent interest. 


PROPOSITION 8.3. Jf A is an algebraic extension of an infinite field F then the 
cardinality |A| = |F|. 


Proof. A is a disjoint union of finite sets, each of which is the set of roots in A of a monic 
irreducible polynomial of positive degree. Then |A| is the same as the cardinality of this 
collection of finite sets and hence the same as that of the set of corresponding polynomials. If 


we use the result that the product of two infinite cardinals is the larger of the two, we see that 
the cardinality of the set of monic polynomials of a fixed degree is |F| and hence of all monic 
polynomials is |F|. Hence, |4| = |F]. D 


If F is finite, the preceding argument shows that any algebraic extension of F is either finite 
or countable. We can now prove the existence of algebraic closures. 


THEOREM 8.1. Any field F has an algebraic closure. 


Proof. We first imbed F in a set S in which we have a lot of elbow room. Precisely, we 
assume that |S| > |F| if F is infinite and that S is uncountable if F is finite. We now define a set 
A whose elements are (E, +, -) where E is a subset of S containing F and +,: are binary 
compositions in E such that (E, + , +) is an algebraic extension field of F. We partially order A 
by declaring that (E, + ,-) > (E', + ',-') if E is an extension field of E’. By Zorn’s lemma there 
exists a maximal element (.E, +, +). Then F is an algebraic extension of F. We claim that Æ is 
algebraically closed. Otherwise we have a proper algebraic extension E’ = E(a) of E. Then |E’| 
< |S], so we can define an injective map of E” into S that is the identity on E and then we can 
transfer the addition and multiplication on Z’ to its image. This gives an element of A # (E, +, 
-), contrary to the maximality of (E, +, -). This contradiction shows that E is an algebraic 
closure of F. O 


Next we take up the question of uniqueness of algebraic closures. It is useful to generalize 
the concept of a splitting field of a polynomial to apply to sets of polynomials. If = {f} is a 
set of monic polynomials with coefficients in F, then an extension field E/F is called a 
splitting field over F of the set T if 1) every f € T is a product of linear factors in E[x] and 2) 
E is generated over F by the roots of the fe T. It is clear that if E is a splitting field over F of 
T, then no proper subfield of E/F is a splitting field of and if K is any intermediate field, then 
E is a splitting field over K of I. Since an algebraic closure E of F is algebraic, it is clear that 
E is a splitting field over F of the complete set of monic polynomials of positive degree in 
F[x]. The isomorphism theorem for algebraic closures will therefore be a consequence of a 
general result on isomorphism of splitting fields that we will now prove. Our starting point is 
the following result, which was proved in BAI, p. 227: 


Lety : a ~a be an isomorphism of a field F onto a field F, f(x) e F[x] be monic of positive 
degree, /(x) the corresponding polynomial in F[x] (under the isomorphism, which is 7 on F 
and sends x ~ x), and let E£ and E be splitting fields over F and F of f(x) and f(x) respectively. 
Then 7 can be extended to an isomorphism of E onto E. 


We shall now extend this to sets of polynomials: 


THEOREM 8.2. Let 7 : a ~ aā be an isomorphism of a field F onto a field F, T a set of 
monic polynomials f(x) e F[x], ë the corresponding set of polynomials f(x) € F[x], E and E 
splitting fields over F and F of T and i respectively. Then n can be extended to an 
isomorphism of E onto E. 


Proof. The proof is a straightforward application of Zorn’s lemma. We consider the set of 
extensions of 7 to monomorphisms of subfields of E/F into E/F and use Zorn’s lemma to 
obtain a maximal one. This must be defined on the whole of E, since otherwise we could get a 
larger one by applying the result quoted to one of the polynomials f(x) e T. Now if ¢ is a 
monomorphism of E into E such that éF = q, then it is clear that ¢(£) is a splitting field over F 
ofi. Hence éE) = E and é is an isomorphism of E onto E. O 


As we have observed, this result applies in particular to algebraic closures. If we take F = 
F and 7 = 1, we obtain 


THEOREM 8.3. Any two algebraic closures of afield F are isomorphic over F. 


From now on we shall appropriate the notation F for any determination of an algebraic 
closure of F. If A is any algebraic extension of F, its algebraic closure A is an algebraic 
extension of A, hence of F, and so A is an algebraic closure of F. Consequently, we have an 
isomorphism of A/F onto F/F. This maps A/F into a subfield of F/F. Thus we see that every 
algebraic extension A/F can be realized as a subfield of the algebraic closure F/F. 


EXERCISE 


1. Let E be an algebraic extension of a field F. A an algebraic closure of F. Show that E/F 
is isomorphic to a subfield of A/F. (Hint: Consider the algebraic closure A of A and note 
that this is an algebraic closure of F.) 


3.2 THE JACOBSON-BOURBAKI CORRESPONDENCE 


Let E be a field, F a subfield, and let End;F be the ring of linear transformations of E regarded 
as a vector space over F. We have the map 


(3) F ~> EndpE, 


which is injective since we can recover F from End;£ by taking EndgnapgE, which is the set of 
multiplications x ~ ax, a € F (pp. 205-206), so applying this set to 1 gives F. 
We now seek a characterization of the rings of endomorphisms EndpF for a fixed E that 


does not refer explicitly to the subfields F. We observe first that for any b e E we have the 
multiplication bg : x ~ bx, x € E, and this is contained in EndpE no matter what F we are 


considering. This is one of the ingredients of the characterization we shall give. The second 
one will be based on a topology of maps we shall now define. 


Let X and Y be arbitrary sets. Then the set Y of maps of X into Y can be endowed with the 


product set (of copies of Y) topology in which Y is given the discrete topology. A base for the 
open sets in this topology consists of the sets 0; y, where fis a map X > Y, {x|l <i < n} a 


finite subset of X, and 0, Gr 1e Yge) = fx), 1 <i <n). We shall call this topology the 


finite topology for Y* and for subsets regarded as subspaces with the induced topology. If X 
and Yare groups, the subset hom (X, Y) of homomorphisms of X into Yis a closed subset of Y. 
For, if fis in the closure of hom(X, Y) and x,, x, are arbitrary elements of X, then the open set 


0; TET contains a g € hom(X, Y). Since g(x;x2) = g(xı)e(x2), it follows that fx1x) = 
f(x x2) and since this holds for all x), x2, we see that fe hom(X, Y). In a similar fashion we 
see that if M and N are modules for a ring R (not necessarily commutative), then homg(X,Y) is a 
closed subset of Y*. The density theorem for completely reducible modules (p. 197) has a 


simple formulation in terms of the finite topology: If M is a completely reducible R-module, 
then the set Ry of multiplications aj : x ~ ax is dense in Endgng,w, that is, the closure of Ry 


is Endgng,w/. This formulation is clear from the definition of the topology. We shall now 


prove a result that includes a characterization of the rings End,F for E a field and F a subfield. 


THEOREM 8.4. Let E be a field, } the set of subfields of E,T the set of subrings L of the 
ring of endomorphisms of the additive group of E such that (1) L > Ep, the set of 


multiplications ag in E by the elements a of E, and (2) L is closed in the finite topology. 
Then the map F ~ End; is a bijective order-inverting map with order-inverting inverse of 
$ onto I. 


Proof. We have seen that the indicated map is injective, and F; > F; if and only if End, E c 
End „E. It remains to show that it is surjective. Let L be a ring of endomorphisms of the 
additive group of E, and regard E as L-module in the natural way. Since F is a field, Æ has no 
ideals # 0, E, and hence Æ is irreducible as E-module. Since L > Ey, E is irreducible also as 
L-module. The inclusion L > E; implies that End; E c EndpE. Since E is a commutative ring, 
End,;E = Eç. Hence End;F is a subring of Ep and so this has the form Fp, the set of 
multiplications in £ determined by the elements of a subring F of E. By Schur’s lemma, Fg is a 
division ring. Hence F is a subfield of E. By the density theorem, L is dense in Endgyg, gE = 
End;E. Since L is closed, L = End; for the subfield F. This proves the surjectivity of F ~ 
End;F onto I. Thus our map is bijective. D 


We have previously noted the inverse map from T to >’. The preceding proof gives this 
again in slightly different and somewhat more convenient form, namely, for given L e T the 
corresponding subfield F is the set of a € E such that 


(4) apl = lar 


for every le L. 

The ring of endomorphisms L = EndpE contains Eg and hence can be regarded as E-E- 
bimodule by defining al = agl, la = lag, l E L, a € E. Since la could also be interpreted as the 
image of a under /, we shall always write /(a) when the latter is intended. For the present we 
shall be interested only in the left action of E£ on L, that is, L as (left) vector space over £. 

We consider first the dimensionality [L : F] (as left vector space over E). Suppose that x, 

., X„ are elements of E that are linearly independent over F. Then there exist /; € L = End-E 
such that 


(5) li(x;) = Òi j | < I, j Sn. 


Hence if a; € E, then }/,a,/(x;)= a;, which implies that the /; are linearly independent elements 


of L over E. Evidently this implies that if [E : F] = œ, then [Z : F] = æ. On the other hand, if [E 
: F] < œ and (x,..., x,) is a base, then the /; such that (5) holds are uniquely determined. 


Moreover, if / e L and /(x;) = a,, then };;a;l;@œx;) = /(x;). Since the X; form a base for E/F and / 
and J'a,l; are linear transformations in E/F, this implies that / = }'aJ:. Thus (/),..., /,,) is a base 
for L over E. We have proved the following important result. 


PROPOSITION 8.4. Jf F is a subfield of the field E and L = End;E, then [F : F] < œ if and 
only if[L : E] < œ and in this case we have [E : F] = [L: E]. Moreover, if (x},..., X,) is a base 
for EIF, then the linear transformations l; such that 1{x;) = 0, 1 <i < n, form a base for L 
over E. 


ij? 


This result shows that in the correspondence between subfields of Æ and rings of 
endomorphisms of its additive group given in Theorem 8.4, the subfields of finite codimension 
in E ([E : F] < ©) and the subrings L such that [L : E] < œ are paired. We shall now show that 
in this case, condition (2) that L is closed is superfluous. This will follow from 


PROPOSITION 8.5. Any finite dimensional subspace V over E of the ring of 
endomorphisms of the additive group of E is closed in the finite topology. 


Proof. Letx €e E and consider the map x* : / ~ I(x) of V into E. This is E-linear, so x* € V*, 
the conjugate space of V/E. Let W be the subspace of V* spanned by the x*. Since x*(/) = I(x) = 
0 for all x implies that / = 0, W= V*. Hence if [V : E] =n, then we can choose x; e F,1<i< 


n, such that (x,*,..., x,,*) is a base for V*. Then there exists a base (/),..., /,) for V over E such 
that the relations /,(x;) = 0;, 1 < i < n, hold. Now let / be in the closure of V and let x be any 
element of E. Then there exist a; € E such that > ja,l(x;)= U(x;) for 1 <j < n and Y)ajl(x) = U(x). 
The first set of these equations and /(x;) = 0, give a; = /(x;) so the a; are independent of x. Then 
the last equation shows that / = X 'a;l; € V. Hence v is closed. 0 


Theorem 8.4 and Propositions 8.4 and 8.5 clearly imply the 


JACOBSON-BOURBAKI CORRESPONDENCE. Let F be a field, ® the set of subfields 
F of E of finite codimension in E, A the set of subrings L of the ring of endomorphisms of the 
additive group of E such that (1) L > Ep, the set of multiplications in E by the elements of E, 


and (2) L as left vector space over E is finite dimensional Then the map F ~ End;E is 


bijective and order-inverting with order-inverting inverse of ® onto A. Under this 
correspondence [E : F] = [EndpE : E] and if L is given in A, the corresponding subfield F of 


E is the set ofa € E such that a € 1=la,(or al = la) for allle L. 


EXERCISES 


1. Let E = (2/(p))(x) where p is a prime and x is an indeterminate. Let F = (2/(p)) 
(x?°)where e > 1. Let D; € EndpE be defined by Dx” = (")x”"~ 4,1 <i < pë — 1. Verify 
that if a € E, then 


j 
D ja, = y (D; ;a)Di 


i=0 


and D,D; = (^D; r tap =DD = 0+] 2 p Prove that (1 D pea Dio 


, İs a base for EndpE over E. 
2. Let [E : F] =n < œ and let (/;,... /,) be a base for End;-F, (xj,..., x,,) a base for E. Show 
that the matrix (/(x;)) € M,(E) is invertible. 


3. Extend the results of this section to division rings. 


3.3 FINITE GALOIS THEORY 


If K and E are fields, we denote the additive group of homomorphisms of the additive group 
(K, + ,0) into (E, + ,0) by hom (K, E) and if K and E have a common subfield F, then hom,({K, 


E) denotes the subset of hom(K, £) consisting of the F-linear maps of K into E. The additive 
group hom(K, F) has an E-K-bimodule structure obtained by defining af for a € E, fe hom(K, 
E) by (af)(x) = af(x), x € K, and fb for b e K by (fb)(x) = fbx). The subset hom,({K, F) is a 
submodule of the E-K-bimodule hom(K, F). In the special case, which we considered in the 
last section, in which K = E, we have End E and End;E, and thee are E-E-bimodules with the 
module compositions as left and right multiplications by the elements of Æp. The result we 
proved on the dimensionality of EndpE in the case in which [E : F] < œ can be extended to 
prove 


PROPOSITION 8.6. Let E and K be extension fields of the field F and assume that [K : F] 


=n<o. Then the dimensionality of hom,(K, E) as left vector space over E is [K : F] =n. 


Proof. The proof is identical with that of the special case: If (x),..., x,,) is a base for K/F, let 
l; be the linear map of K/F into E/F such that /(x;) = 6;, 1 <, j < n. Then as in the special case 
we see that (/),..., /,,) is a base for hom,(K, E) over £. O 


Let s be a ring homomorphism of K into F. The existence of such a homomorphism 
presupposes that the two fields have the same characteristic and then s is a monomorphism of 
K into E. Evidently s e hom(K,£). Moreover, if a, b e K, then s(ab) = s(a)s(b). In operator 
form this reads 


(6) sa = s(a)s 


for all ae K. If K and F have a common subfield F, then a ring homomorphism of K/F into E/F 
is a ring homomorphism of K into E, which is the identity on F. Then s e hom;,(K, E). 


We recall the Dedekind independence theorem for characters: distinct characters of a group 
into a field are linearly independent over the field (BAI, p. 291). A consequence of this is that 
distinct ring homomorphisms of K into F are linearly independent over F. We use this and 
Proposition 8.6 to prove 


PROPOSITION 8.7. Jf K and E are extension fields of the field F and [K : F| = n < œ, then 
there are at most n (ring) homomorphisms of K/F into E/F. 


Proof. If s1,..., S, are distinct homomorphisms of K/F into E/F, these are elements of 
hom,(K, F) that are linearly independent over E. Since the left dimensionality of hom,({K, E) 
over F is n, it follows that m <n. O 


Suppose that G is a group of automorphisms of E. Put 
(7) EG = {Y ajs;|a;e E, sie G}. 


Then EG contains Eg = El and formula (6) shows that EG is a subring of the ring of 


endomorphisms, End E, of the additive group of E, since it implies that (as)(bt) = as(b)st for 
the automorphisms s, t, and the elements a, b e E. If Gis finite, say, G = {s, = 1, 5y,..., Sp}, 


then the s; form a base for EG over E. Hence the dimensionality [EG : E] = n = |G|. Thus EG 


satisfies the conditions for the rings of endomorphisms in the Jacobson-Bourbaki 
correspondence. The corresponding subfield F is the set of a € E such that agl = lap for all l € 


EG. Since arb; = bgap, this set of conditions reduces to the finite set ags; = sjaz or as; = sa 
for all s; e G. By (6), these conditions are equivalent to S(a) = a, 1 < i < n. Hence the field 
associated with EG in the correspondence is 


(8) F =InvG, 


the subfield of E of elements fixed under every s; e G. We have 

(9) [E:F] = [EG: £] = |G| 
and 

(10) EG = End;E. 


Now let Gal E/F be the Galois group of E/F, that is, the group of automorphisms of F fixing 
every element of F. Evidently Gal E/F > G. Let t e Gal E/F. Then t e EndpE, so by(10),t = 


> "a8; a; € E, s;€ G. By the Dedekind independence theorem ¢ = s; for some i. Thus 


(11) Gal E/F = G. 
We shall require also the following result on subrings of EG containing Ep. 


PROPOSITION 8.8. Let G be a finite group of automorphisms in the field E. Then the 
subrings of EG containing Eç are the rings EH, H a subgroup of G. 


Proof. Clearly any EH is a subring of EG containing Ep = El. Conversely, let S be such a 


subring and let H = S N G. Clearly H is a submonoid of G. Since G is finite, His a subgroup. It 
is evident that EH c S. We claim that EH = S. Otherwise, we have an element a,s, + ays. +... 


+ aS; +... + a,s, E€ S such that no S, € H and every a; # 0. Evidently r > 1 and assuming r 


minimal, the proof of Dedekind’s theorem gives an element with the same properties and a 
smaller r. This contradiction proves that S = EH. 0 


We shall say that a subfield F of E is Galois in E, alternatively, E is Galois over F if there 
exists a group of automorphisms G of E such that Inv G = F. We can now state the main result 
on finite groups of automorphisms in a field. 


THEOREM 8.5. Let E be a field and let A be the set of finite groups of automorphisms of 
E, w the set of subfields of E that are Galois in E and have finite codimension in E. Then we 
have the map G ~ Inv G of A into w and the map F ~ GalE/F of y into A. These are inverses 
and are order-inverting. We have 


(12) |G| = [E: Inv G]. 


If F € wand K is a subfield of E containing F, then K € y. Moreover, Gal E/K is a subgroup 
of Gal E/F and Gal E/K is normal in Gal E/F if and only if K is Galois over F, in which case 


(13) Gal K/F = (Gal E/F)/(Gal E/K). 


Proof. It is clear that the maps G ~ Inv G, F ~ Gal E/F are order-inverting. Now G ~ Inv G 


is surjective, by definition of y, and since Gal E/ Inv G = G, it is bijective with inverse F ~ 
Gal E/F. Also we had (9), which is the same thing as (12). Now let F € yw and let K be a 
subfield of E containing F. Then F = Inv G for some finite group of automorphisms G of E. 
Since K > F, the corresponding ring End,F in the Jacobson-Bourbaki correspondence is a 


subring of EndpE = EG containing Ep. By Proposition 8 this has the form FH, for a subgroup H 


of G. Then K = Inv He y. Ifs e G, then sHs ~ ! is a subgroup of G and Inv sHs ~~ ! is s(K). 
Hence H is normal in G if and only if s(K) = K for every s e G. Then the restriction §:= s|K is 
an automorphism of K. The set of these is a finite group of automorphisms G of K. It is clear 
that F = Inv G. Hence if H is normal in G, then K is Galois over F. Moreover, in this case G = 
Gal K/F and s ~ §: is an epimorphism of G onto G. Evidently the kernel is Gal E/K. Hence we 
have the isomorphism (13). It remains to show that if K is Galois over F, then H is normal in G 
or, equivalently, s(K) = K for every s e G. If s(K) £ K, then s|K is a homomorphism of K/F into 
F/F, which is not an automorphism of K/F. On the other hand, the order of Gal K/F is [K : F]. 
This gives [K : F] + 1 distinct homomorphisms of K/F into E/F, contrary to Proposition 8.6. 
This completes the proof. 0 


It remains to determine the structure of finite dimensional Galois extension fields. For this 
we use an important addendum to the isomorphism theorem for splitting fields given in BAI, p. 
227, namely, that if Æ is a splitting field over F of a polynomial with distinct roots, then | Gal 
E/F |= [E : F]. It is easily seen that the hypothesis that f(x) has distinct roots can be replaced 
by the following: f(x) is separable, that is, its irreducible factors have distinct roots. If F’ = 
Inv (Gal E/F), then F’ > F and [E : E'] = |Gal E/F| = [E : F], which implies that F = F” is 
Galois in E. Conversely, let E be finite dimensional Galois over F. Let a € E and let {a, =a, 
..., a,} be the orbit of a under G = Gal E/F. Then the minimum polynomial of a over F is g(x) 
= [|(« — a), which is a product of distinct linear factors in E. Since ŒE is finite dimensional 
over F, it is generated by a finite number of elements a, b,.... Then it is clear that E is a 
splitting field over F of the polynomial, which is the product of the minimum polynomials of a, 
b,.... This polynomial is separable. Hence we have 


THEOREM 8.6. E is finite dimensional Galois over F if and only if E is a splitting field 
over F of a separable polynomial f(x) € F(x]. (Compare Theorem 4.7 on p. 238 of BAL.) 


Theorems 8.5 and 8.6 give the so-called fundamental theorem of Galois theory, the pairing 
between the subfields of E/F and the subgroups of G = Gal E/F (BAI, p. 239). 


EXERCISES 


1. Let E be Galois over F (not necessarily finite dimensional) and let K be a finite 
dimensional subfield of E/F. Show that any isomorphism of K/F into E/F can be 
extended to an automorphism of £. 


2. Let s be a ring homomorphism of K into E, so s e hom(K, E). Show that E, is a 


submodule of hom(K, E) as E-K-bimodule. Show that distinct ring homomorphisms of K 
into E give rise in this way to non-isomorphic E-K-bimodules and that any E-K- 
bimodule that is one-dimensional over E is isomorphic to an £,. 


3. Use the theory of completely reducible modules to prove the following result: If s4,..., S, 
are distinct ring homomorphisms of K into £, then any E-K-submodule of >),’Es; has the 
form >) Es;, where {s;} is a subset of {s;}. 


4. Let F be the algebraic closure of F and let E and F" be finite dimensional subfields of F 
/F such that E is Galois over F. Let E' be the subfield of F/F generated by E and F". 
Show that E’ is Galois over F” and the map g' ~ g'|E is an isomorphism of G’ = Gal 
E'/F' onto the subgroup Gal E/(E N F’) of G = Gal E/F (see Lemma 4, p. 254 of BAI). 
Show that EVF"=E® 79 pF'. 


3.4 CROSSED PRODUCTS AND THE BRAUER GROUP 


We shall now make an excursion into non-commutative algebra and give an important 
application of Galois theory to the study of the Brauer group of similarity classes of finite 
dimensional central simple algebras over a field F. We shall study a certain construction of 
central simple algebras called crossed products, which are defined by means of a Galois 
extension field E£ and a factor set of the Galois group into the multiplicative group E* of non- 
zero elements of Æ. Such crossed products in the case of cyclic field extensions are called 
cyclic algebras. These were introduced by Dickson in 1906 and were apparently inspired by 
an earlier construction of infinite dimensional division algebras that had been given by Hilbert 
in his Grundlagen der Geometrie (1899). The most general crossed products were studied by 
Dickson and his students in the 1920’s. The form of the theory that we shall present here is 
essentially due to Emmy Noether. 

We begin with a finite dimensional Galois extension field E/F with Galois group G. We 
form a vector space A over E witha base {u,|S € G} in 1 — 1 correspondence with the Galois 


group G. Thus the elements of A can be written in one and only one way in the form >, < G 
Pls, Ps E E. We shall introduce a multiplication in A that will render A an algebra over F. The 
defining relations for this multiplication are 


(14) usp = (Sp)us, Uti K, ithe 


where the k, , are non-zero elements of E and u, is 1 u,. More precisely, we define the product 
in A by 


(15) Y pss} { >, Ore |=) ks rps(Sor)Use 


seG teG 


where the k, , are non-zero elements of E. In particular, we have the product u,u, = k, u,, and 
hence for any s,t,v e G we have (u,uJu, = ky pusty = ky Kop Usp, and usum) = ulki Uy) = 


(Sk, )Ks p»Usey. Hence to insure associativity of the product defined by (15) we must have 
(16) ks 1Kst.v = (Sky )Ks.ws ds f; VE G. 


We observe that these conditions are exactly the conditions under which the map (s, £) ~ k, + of 


G x G into the multiplicative group E* of non-zero elements of E constitutes a 2-cocycle (p. 
361). Here the action of G on E* is the natural one where sp is the image of p under s € G. As 
in the case of group extensions, we call a 2-cocycle k : (s, £) ~ k, , a factor set of G in E*. 


From now on we assume that the k, , satisfy (16), so k is a factor set. Then it is straightforward 


to verify that (15) is an associative product. Moreover, distributivity with respect to addition 
holds. The factor set conditions (16) imply that 


(17) kism Ki i and ki = Skyy 


and these imply that 
(18) L= kī iu: 


is the unit for A with respect to multiplication. Finally, we note that with respect to the vector 
space structure of A relative to the subfield F of E we have the algebra conditions 


(19) a(ab) = (xa)b = a(b) 


fora e F,a,b € A. These follow directly from (15). Hence A is an algebra over F. We call A 
the crossed product of E and G with respect to the factor set k and we write A = (E, G, k) 
when we wish to display the ingredients in the definition. 

We now investigate the structure of crossed products and we prove first 


THEOREM 8.7. A= (E, G, k) is central simple over F, and [A : F] = n? where n = [E : F]. 
E( = El) is a subfield of A such that the centralizer C,(E) = E. 


Proof. We have the dimensionality relation [A : F] = [A : E] [E : F] = n?. We note next that 


since u,u, -1 = K,, , - 1K, 1 18 a non-zero element of E and the same is true of u _ \u,, u, iS 
$ S 


invertible in A. Now let B be an ideal in A and write a =a + B, a € A. Suppose that B + A, so A 


= A/B + 0. Then p ~ P is a monomorphism of F into A and the elements ñ, are invertible in 
A. We have the relations # = $P a. The Dedekind independence argument on shortest relations 


(BAI, P. 291) shows that the ž, are left linearly independent over B={ P lp € E}. Hence [A 


: E] =n and [A : E] [E : F] = n? = [A : F]. It follows that B = 0 and A is simple. Let p € E 


and suppose that >’p,“s commutes with p. Then >'(p — sp)p,u, = 0, which implies that p = sp for 
every s such that p, # 0. It follows that if $ p Ļu, commutes with every p € F, then p, = 0 for 
every s # 1. Then }’p,u, = pu, € E. This shows that C,(£) = E. Next suppose that c is in the 
center of A. Then C € E = C,(£) and cu, = u,c, which gives sc = c for every S e G. Then C e 
F. Hence the center is F, and A is central simple over F. O 


The fact that E is a subfield of A = (F, G, k) such that C,(£) = E implies that Æ is a splitting 
field for A: Ap = E pA = M,(E) (Theorem 4.8, p. 221). We recall that the similarity classes 
{A} of the finite dimensional central simple algebras A having a given extension field F of F as 
a splitting field constitute a subgroup Br(E/F) of the Brauer group Br(F) of the field F (p. 
228). The result we have proved implies that any factor set fe of the Galois group G into E* 
determines an element [(F, G, k)] of Br(E/F). Now consider the element [k] = kB?(G,E*) of 
H(G, E*) = Z(G, E*)/B*(G, E*) (p. 361). We claim that all of the factor sets in [k] determine 
the same element in Br(E/F). Ifk' € [k], then we have relations 


(20) Ks ne Us(SHi) Hst kin SFE G, 


where u, € E*. Let A' = (E, G, k') be the crossed product defined by k' and let (u's) be a base 
for A' so that u'p = (sp)u', and u',u',=k', M'se Now consider again the crossed product A = 
(E, G, k) with base (u,) over E as before. Put v, = uu, Then it is clear that the v, form a base 
for A over E. Moreover, we have vp = ull, = (spu, = (sp)vp and vV, = Ulgi = 
Ms (SPU = Ms(SU)K, pls |= Ks, t> Vse It is clear from these relations that the map >’ p,v, ~ 
>’, is an isomorphism of A onto A’. Thus A = A’ and hence A ~ A’ that is, [A] = [A’] in the 
Brauer group. 
We can now define a map 


(21) [k] > [(E, G,¥)] 


of H(G, E*) into Br (E/F). We shall show that this is an isomorphism. We prove first that (21) 
is surjective by proving 


THEOREM 8.8. Let A be a finite dimensional central simple algebra over F having the 
finite dimensional Galois extension field E/F as splitting field. Then there exists a factor set 
k such that A ~ (E, G, k). 


Proof. By Wedderburn’s theorem, 4 = M,(A) where A is a central division algebra. E is a 


splitting field for 4 if and only if E is a splitting field for A and the condition for this is that 
there exists an r’ such that E is a subfield of A' = M,(4) such that C,,(£) = E (Theorem 4.8, p. 


221). By Theorem 4.11 [A’: F] = [E : F] [C, (E) : F] =n’. Evidently A’ ~ A. We recall that by 
Theorem 4.9 (p. 222), any automorphism of E can be extended to an inner automorphism of A’. 


Accordingly, for any s € G there exists a u, € A’ such that sp = u,pu,~', p € E. Now lett € G 
and consider the element u,u,u/. It is clear that this commutes with every p € E. Hence u,uu,/! 
=k, ¿ E€ E* and u,u,=k, Mst The associativity conditions (u,u,)u,, = u,(u,u,) imply as before 
that k : (s, £) ~k, ¿is a factor set and so we can form the crossed product (E, G, k). It is clear 
that A” = {9 pup; E€ E} is a subalgebra of A’. We have a homomorphism of (E, G, k) onto this 
subalgebra. Since (E, G, k) is simple, this is an isomorphism. Thus 4” = (E, G, k) and hence 
[2 : F] =n? =[A': F]. Then A” = A. Thus A~A' = (E, G, k). 0 


We prove next the following basic multiplication formula for crossed products: 
THEOREM 8.9. (F, G, k) 8 p (E, G, D) ~ (E, G, kl). 


Proof. Put A= (E, G, k) B = (E, G, 1). In considering the tensor product 4 ® B we may assume 
that, A and B are subalgebras of an algebra AB in which every element of A commutes with 
every element of B, and if x;,..., x, E A (B) are linearly independent over F, then these 


elements are linearly independent over B (A) in the sense that >’ x;y; = 0 for y; € B (A) only if 
every y; = 0. We need to distinguish between the two copies of E contained in A and B and we 
denote these by £; and E, respectively. We have an isomorphism p ~ p; of E onto E; and we 
write (sp); for (sp); We have a base (u,) for A over E; and a base (v,) for B over E, such that 
Usp) = (sp uss UU, = k Usp V9 7 (SP2)Vs, Vee ly 12Vst- 

Since E/F is separable, it has a primitive element: EF = F(@). Let f(A) be the minimum 
polynomial of 0 over F. We have A4) =] |; < g A -— s0) in E[A] and hence f(A) = J [(4 — s@,) and 
fA) = [IQ — s0) in E [A] and £,[A] respectively. Consider the Lagrange interpolation 
polynomial 


fl) 
p (A—s0;) f'(s0;) 


flå) 
(22) 
= || (4-t0,)/ [| (s0, —t0:) 


in E [A]. Since 1 — eg f(A) is of degree < n -— 1 and this polynomial is 0 for the n distinct 
values s04, /—>f,(A)= 0 and 


(23) 7 A= 


seG 
Also ifs 4 t, then 


(24) WA) f(A)=9 (mod f(A)). 


Now consider the commutative subalgebra FE, of AB. Put 
(25) es = fe(02)€ Ey Ep. 


Then e, £ 0 since 1, 65,..., 9,”~ | are linearly independent over E}. By (23), we have >, < G € 
= 1, and by (24), e,e, = 0 if s + t. Hence multiplication of Ye, = 1 by e, gives ef = e, Thus we 
have 


(26) eh = 2020 Tete Jesi 


The definition of e, gives (8, — s0,)e, = 0 since (03 — s0 DI [r+ (82 — t81) = f(@z) = 0. It follows 
that for p) € E, we have 


(27) pres = (spi)es. 
Then EE; = @,£ Ee, = @ FE) E,e, and Ee, = E. As we have seen (p. 204), this decomposition 
of EE, as direct sum of simple algebras is unique . 


Now consider the inner automorphisms x ~ uxu; | and x ~ væv,” | in AB. These stabilize 
EE, and hence permute the simple components of £,£, and hence the idempotents e,. If we 


apply these automorphisms to the relations (27) and take into account that the first is the 
identity on B and the second is the identity on A, we see that 


pr(uresty ') = (tsps) (tresu; *) 


tp2(v,esv; |) = (spy) (viesty '). 


Comparison of these relations with (27) gives 


(28) Utes = Cts Uy 
Vies = est- 10}. 

Now put 

(29) est = CsUsUy 


Then e, , = e, and using (28), we see that the n? e, , constitute a set of matrix units: 


(30) Estse = 0r,5'@s,t' xr =|. 


Hence, by Theorem 4.6, AB = M,(C) where C is the centralizer in AB of the e, ,. Equally well, 
C = e; ;ABe, , and we proceed to calculate this algebra. To begin with, we know that [AB : F] 
= nf and hence [C : F] = n? and [e,, ; ABe; , : F] =n’. Now e; ;ABe, , contains Eye, ; = Eye 
and if we put 


(31) Ws = UsVs@y 1 
we have, by (28), that w, =e) ;U,V,, SO W, € e; ;ABe, 1. Now 


Ws(P1e1,1) = UsUsP 10) = SPyUsdsey = (sp1)ey.1Ws 
and 


Ws Wr = UsUsth Vie, = UslyVsV,) = Ks 11Usels 20s 
= ks rls 121 Wse = ks,rils,t1€1Ws 


= (ks, ls) Wst. 


It follows that e;  ;4Be; ı contains a subalgebra isomorphic to (E, G, kl). Comparison of 
dimensionalities shows that e; ;ABe, \= (E, G, kl). Hence (E, G, k) @p(E ,G, D~(E, G, kl). 0 


We now see that the map [k] ~ [(E, G, K)] is multiplicative from the group H(G, E*) onto 
the group Br(E/F). It follows that [1] ~ [1], which means that (E, G, 1) ~ 1. To complete the 
proof that [k] ~ [(E, G, k)] is an isomorphism, we require the “only if” half of the following 


THEOREM 8.10. (E, G, k) ~ 1 ifand only k~ 1. 


Proof. We have seen that k ~ 1 implies that (E, G, k) ~ 1. Conversely, suppose that (E, G, k) ~ 
1. Then we have an isomorphism a ~ a’ of (E, G, 1) onto (E, G, k). The image E” of E c (E, G, 
1) is a subfield of (E, G, k) and for every s e G = Gal E/F we have an invertible element v’, € 
(E, G, k) such that v' p' = (sp)'v',, p € E, v',v',=v',, The isomorphism p' ~ p of E' into E c (E, 
G, k) can be extended to an automorphism 7 of (E, G, k). This maps v’, into v, and we have v,p 
= (SP)V,, VV; = Vy; On the other hand, for every s e G we have a u, such that u,p = (sp)u, and 


1 


UU, = k commutes with every p € E and hence m,v,~ ! = u, € E and u, = 


ssr Then u,v, 


HsVs. Then k, = Us(Sip sy | so k~ 1. 0 
We re-state the main result we have proved as 
THEOREM 8.11. The map [k] ~ [(E, G, K] is an isomorphism of H(G, E*) onto Br(E/F). 


We have proved in Theorem 6.14 (p. 361) that if G is a finite group and A is a G-module, 


then [K]!©! = {1} for every [k] € H(G, A), n > 0. In particular, this holds for every [k] € H(G, 
E*). It follows from Theorem 8.11 that (E, G, k)'@! ~ 1 for every crossed product. There is an 
important improvement we can make in this result. We write A = (E, G, k) = M_(A) where A is 
a division algebra. We have [A : F] = œ and [A : F] = n? = rd’, so n = rd. The integer d is 
called the index of A. Since A is determined up to isomorphism by A, we can also call d the 
index of A. We now have, 


THEOREM 8.12. Ifd is the index of A=(E, G, k), then Af ~ 1. 


We shall base the proof on matrices of semi-linear transformations. Let S be an s-semi- 
linear transformation in a finite dimensional vector space V over a field E : S is additive and 
S(ax) = (sa)(Sx), a € F, xe V. Let (x1,..., x,) be a base for V/E and write Sx; = )1,, oja. 
Then the matrix ø = (a;;) is called the matrix of s relative to (X),...,X,). S is determined by a 
and s; for, if x = X éx; then Sx = di jf (SS)OjX%;- If T is a t-semi-linear transformation, then it is 
readily verified that ST is an st-semi-linear transformation whose matrix is o(st) where st = 
(ST;;). 

We can now give the 


Proof of Theorem 8.12. Since A = MA), A can be identified with Enda: V, the algebra of 


linear transformation in an r-dimensional vector space V over A' = A”, Since A = (E, G, k) 
contains the subfield E£, V can also be regarded as vector space over E and over F. We have [V 
: F] =[V: E][E : F] =[V: E]n and [V : F] =[V: A] [A': F] = ræ = nd. Hence [V : E] = d. Let 
(u,) be a base for A = (E, G, k) over E such that up = (Sp)u, and u,u, = k,,u,, The first of 
these relations shows that the element u, € End,, V is an s-semi-linear transformation of WE. 
Let (x1,..-, Xq) be a base for V/E and let M(s) denote the matrix of u, relative to (x1,..., Xg). 


Then the relation u,u, = k, 


(32) M(s)sM(t) = k, »M(st). 
Taking determinants we obtain 


(33) K stia = (sp); 


1 


{st gives the matrix relation 


where u, = det M(s). Since u,u,- ! is a non-zero element of E, M(s)sM(s~ ') is a non-zero 
scalar matrix. Hence M(s) is invertible and u, # 0. Then, by (33), we have ix = (su) | 


and so k4 ~ 1. Hence Af ~ 1 by Theorem 8.11. 0 


st 


There is one further question on Br(F, E) and H(G, E*) that we need to consider. We 
suppose we have E > L > F where L is Galois over F and H = Gal E/L. Then H a G and G = 
Gal L/F = G/H. More precisely, we have the canonical homomorphism s ~ §:= s|L of G onto G 


whose kernel is H. Hence G/H =G under sH ~ §:. Let A be a finite dimensional central simple 
algebra over F split by L. Then JA’ is also split by E. By the isomorphism given in Theorem 
8.11 we can associate with A’ an element k' of H?(G, L*) and an element k € H(G, E*). We 
shall obtain an explicit relation between [k'] and [A], namely, we shall show that if k' is (5; £) 
~ks.¢¢ L* CE*, then k is 


(s,t) > kez 


where s ~ §:, t ~f in the canonical homomorphism of G onto G. In view of the definition of the 
isomorphism of H?(G, E*) and Br(£/F), this is an immediate consequence of the following 


THEOREM 8.13 (H. Hasse). Let E be Galois over F with Galois group G, L a Galois 
subfield of E/F, H = Gal E/L, G=.Gal L/F. Let A = (L,G,k') be a crossed product of L and G 
with the factor set k'. Then A' ~ A = (E, G, k) where k,, = k's:f and s ~ &:, t ~ € in the 
canonical homomorphism of G onto G. 


Proof. (Artin, Nesbitt, and Thrall). We form A = A' ® M„(F) where m = [E : L]. Evidently A ~ 
A'. Since L is a subalgebra of A', we have the subalgebra L ® M,, (F) = M,,(L) of A. 

We now consider Æ as vector space over L proceed to define certain semi-linear 
transformations in L/L. First, we have the linear transformations pp : x ~ px for p € E. Next, 


we have the automorphisms s € G. Since s(¢x) = (s (sx) = (§ & (sx), Ge L,x € E, s is an š- 
semi-linear transformation of E/L. Moreover, we have the relation spg = (sp)gs (cf. (6)). We 


now choose a base (x1,..., Xn) for E/Z and consider the matrices in M,,(L) relative to this base 
of the various semi-linear transformations we have defined. Let ii(p) denote the matrix of pr 
relative to (x),..., Xm). Then p ~ “(p) is a monomorphism of E, so the set of matrices p(£) is a 
subfield of the algebra M,(L)/L isomorphic to E/L. Let t, denote the matrix of the s-semi- 
linear transformation s relative to (x1,..., Xm). Then we have the matrix relation 


(34) Ts = T(t) 
and the relation sp; = (sp) zs gives the matrix relation 


(35) TsSuU(p) = pL(sp)ts. 


Now let vs, §:€ G, be a base for A over L such that v; €, = (5 &)vg,, č € L, and Vs: Vi = k's 7 
pv Put u, = t,vg, E A (since t, E€ M,(L) c A and A’ c A). Then uyu(p) = 
temp) = t.5n(p)0 = wispts (by (35)). Thus 


(36) usu(p) = p(sp)us. 


uy = Ta Pzt; Or = 1, (81, )CzUr = takir (by (34)) = kis Tyla = (Ki tse. Hence 


(37) Us; = [A K5.r)Usr 


The relations (36) and (37) imply that we have a homomorphism into A of (E, G, k) where k, , 
= k's ;. Since (E, G, k) is simple, this is a monomorphism and since [(E, G, k) : F] = [E : F}? 
and [A : F] = [L : Fm? =[L: FP [E : L =[E: FY, we have A= (E, G, k). D 


EXERCISES 


1. Show that if L is a subfield of the Galois field E/F, then (E, G, k), = (E, H, k') (over L) 
where H = Gal E/L and k' is obtained by restriction of k to H x H. 

2. Let F be the algebraic closure of F and let E and F' be finite dimensional subfields of F 
/F such that E/F is Galois. Let E' be the subfield of F/F generated by E and F". Let G = 
Gal E/F, G' = Gal E'/F' = Gal E'/F’ so we have a canonical isomorphism g ~ g' = g\E 
of G' with a subgroup of G (Exercise 4, p. 475). Show that (E, G, bp ~ (E', @', k') 
where k' is obtained by restricting k to G' (identified with the subgroup of G). (Hint: 
Consider the case in which F N E = F. Combine this with exercise 1 to obtain the 
general case.) 


3.5 CYCLIC ALGEBRAS 


The simplest type of crossed product A = (E, G, k) is that in which Æ is a cyclic extension field 
of F, that is, E is Galois over F with cyclic Galois group G. Let s be a generator of G and let u 
=u,. Then (u, lu centralizes E ; hence, u = uu, where u; € E*. We can replace u,. by u; 0 < 
i <n = [G: 1]. This replacement replaces the factor set k by k' where 


1 if0<i+1<n, 
(38) gai . Lo 
y fn <it+j <2n-2. 


The crossed product A is generated by E£ and u and every element of A can be written in one 
and only one way as 


(39) Potpiut +py- yu" |, pic E. 


The multiplication in A is determined by the relations 


(40) up =(spu, uu" =». 


Since u” commutes with u and with every element of E*, u” is in the center F of A. Hence y € 


F*. We shall now denote A = (E, G, k) by (E, s, y) and call this the cyclic algebra defined by 
E/F, the generator s of Gand y e F*. 

We shall now specialize the results on Br (E/F) for E/F Galois to the case in which E/F is 
cyclic. In this case we need to consider only factor sets of the form (38). We call such a factor 
set a normalized factor set defined by y. It is readily seen that the normalized factor set 
defined by y and the normalized factor set defined by ô (€ F*) are cohomologous if and only if 
ô = Nz AU), u E€ E*. Accordingly, we are led to consider the group F*/N(E*) where F* is the 


multiplicative group of non-zero elements of F and N(E*) is the subgroup of elements of the 
form Ny), u € E*. Then it is clear that we have an isomorphism of F*/N(E*) onto H?(GE*) 


sending an element yM(E*) of the first group into the class in H(G, E*) of the normalized 
factor set defined by y. If we take into account the isomorphism given in Theorem 8.11, we 
obtain the following result in the cyclic case: 


THEOREM 8.14. The map yN(E*) ~ [(E, s, y)] is an isomorphism of F*/N(E*), onto 
Br(E/F). 


It is readily seen also that Theorem 8.13 implies the following result for E/F cyclic. 


THEOREM 8.15. Let E be a cyclic extension field of F and let L be a subfield of E/F. Let 


§:=s|L where s is a generator of the Galois group of EIF. Then (L, §:, y) ~ (F, s, y”) where m 
=[F:L]. 


We leave the proof to the reader. 


EXERCISES 


1. Show that (£,s y) ~ 1 if and only if y is a norm in Æ, that is, there exists a c e E such that 
y = Nejc). 
2. Show that (E, s, y) 8p (E, s, y) ~ (E, s, yd). 


3. (Wedderburn.) Prove that (E, s, y) is a division algebra if n = [E : F] is the smallest 
positive integer m such that y” = Ng;p(c) for some c € E. 


4. Let Ep be a cyclic extension of Fp of dimension n. For example, we can take Fo to be 
finite with q elements and Ey > Fo such that [Eù : Fo] = n. Let sọ be a generator of the 
Galois group of Ep/Fo. Let E = E,(t) be the field of rational expressions over Ep in an 
indeterminate ¢ and let s be the automorphism in E extending sọ and fixing t. Show that if 
F = F(t), then E/F is cyclic with Galois group “>. Show that (E, s, t) is a division 
algebra. 


5. Specialize exercise 2, p. 484, to show that if the notations are as in this exercise and E/F 


is cyclic and G = «>, then (E, s, y)p ~ (E, s”, y) where G’ = >. 


8.6 INFINITE GALOIS THEORY 


In this section we shall give an extension of the subfield-subgroup correspondence of the finite 
Galois theory to certain infinite algebraic extensions of a field. 

We observe first that if £ is algebraic over F, then any homomorphism s of E into itself that 
fixes the elements of F is an automorphism of E/F. Since any homomorphism of a field into a 
non-zero ring is injective, it suffices to show that s is surjective. To see this let a € E and let 
fix) be the minimum polynomial of a. Let R = {a = a)..., a;} be the set of roots of f(x) in E. 


Then R is stabilized by s and since s|R is injective, s|R is surjective. Hence a = sa; for some j. 
We use this result to prove 


PROPOSITION 8.9. Jf E is algebraic over F then G = Gal E/F is a closed set in the finite 
topology of E®. 


Proof. Lets € G, the closure of G in EF. It is clear from the definition of the topology that s is 
a homomorphism of E into itself, fixing the elements of F. Then s is an automorphism and so s 
eG.0 


It is apparent from this result that we must restrict our attention to closed subgroups of 
automorphisms if we wish to obtain a 1 — 1 correspondence between groups of automorphisms 
and intermediate fields of an algebraic extension. That this is a real restriction can be seen in 
the following example. 


EXAMPLE 


Let F, = Z/(p), the field of p elements, and let F, be the algebraic closure of F,. Since F, is 
algebraic over F,„, the map z: a ~ a? in F p iS an automorphism. We shall show that <*> is not 
closed in Gal F/F. We need to recall some facts from the theory of finite fields (BAI, pp. 287 
— 290). First, every finite field of characteristic p has cardinality p” and for any p” there exists 
a field Fp, with |F n| = p”. This field is a splitting field over F, of x?" — x and is unique up to 
isomorphism. Moreover, every element of is a root of this polynomial. The subfields of F,» are 
the with mln. Since F p is the algebraic closure of F,, it contains every FSince Fin O F,” c 
Fpm it is clear that any finite subset of F, is contained in one of the subfields F,,”. It is clear 
also that any automorphism of F p'Fp stabilizes every finite subfield Fn. The Galois group of 
F’n/F,, consists of the powers of the map a ~ a”. It follows that if s is any automorphism of F 
p' Fp» then the restriction of s to any F,» coincides with the restriction of a suitable power of z 


to Fn. This implies that the closure <*> contains every automorphism of F p' Fp. Since the latter 
is closed, we have <"> = Gal F,/F,. We shall now show that «1> # «) by producing an 
automorphism of F „/F, that is not a power of z. For this purpose we choose any infinite proper 
subfield K of F,. For example, we can take a prime q and let K be the union of the subfields 
Fm for m = q”, r = 1, 2,.... Since this set of subfields is (totally) ordered, their union is a 
subfield and it is clear that it is an infinite subfield of F p: Moreover, it is a proper subfield, 
since it contains no F,» where n is not a power of q. Now let K be any proper infinite subfield 
of F, and leta € F,, ¢ K. Let f(x) be the minimum polynomial of a over K. Then deg f(x) > 0 
and f(x) has a root b # a in F,. Since F, is a splitting field over K(a) and over K(b) of the set 
of polynomials {x?” — x/n = 1, 2,...}, it follows from Theorem 8.2 that there exists an 
automorphism s of Fp/K sending a ~ b. Thus s + 1 and the subfield of s-fixed elements contains 
K and so is infinite. On the other hand, if k > 1, the set of fixed points under 7“ is the finite set 


of solutions of x?” = x. It follows that s + n% for any k, sos ¢ “>. 


The infinite Galois theory of automorphisms of fields is concerned with splitting fields of 
separable polynomials. Let F be a field, T a set of separable monic polynomials with 
coefficients in E, and let E be a splitting field over F of T. We have 


PROPOSITION 8.10. Any finite subset of E is contained in a subfield L/F that is finite 
dimensional Galois over F. 


Proof. Let I" be the set of products of the polynomials in T. If F e I", the subfield Ly over F 


generated by the roots of fin E is a splitting field over F of f. By Theorem 8.6 this is finite 
dimensional Galois over F. Evidently Ly, > Ly U Lg. Hence E’ = Ur e I” Lris a subfield of E 


and since every f € T'is a product of linear factors in E"[x], we have E' = E. Thus F is a union 
of finite dimensional Galois fields over Æ. Evidently this implies our result. O 


We prove next 


PROPOSITION 8.11. Let k be a subfield of E/F that is Galois over F Then any 
automorphism of E/F stabilizes K. Moreover, the map s ~ s|K of Gal E/F into Gal K/F is 
surjective. 


Proof. Leta € K and let f(x) be the minimum polynomial of a over F. If s e Gal K/F, then 
f(s(a)) = 0, so s(a) is one of the finite set of roots of f(x) in E. Hence the orbit of a under Gal 


KIF is a finite set {a = a,..., a,}. It follows that f(x) = ] j"; (x — a,) and if t e Gal E/F, then 
t(a) is one of the a;. Hence t(a) € K. This proves the first assertion. To prove the second, we 


observe that Æ is a splitting field over K of I. Hence by Theorem 8.2, any automorphism of 
KIF can be extended to an automorphism of E/F. This proves the second assertion. O 


We can now prove the main theorem of infinite Galois theory. 


THEOREM 8.16 (Krull). Let E be a splitting field over F of a set I of monic separable 
polynomials and let G = Gal E/F. Let a be the set of closed subgroups of G, E the set of 
subfields of E/F. Then we have the map H ~ Inv H of ^ into } and K Gal E/K of } into A. 
These are inverses and are order-inverting. A subgroup H € n is normal in G if and only if K 
= Inv His Galois over F and if this is the case, then Gal K/F = (Gal E/F)/(Gal E/K). 


Proof. Itis clear that if H € a, then Inv He > and if K e e, then Gal E/K is closed and hence 
is in A. It is clear also that the indicated maps are order-inverting. Now let a € E, ¢ F, and let 
L be a subfield containing a that is finite dimensional Galois over F. Then we have an 
automorphism of L/F that moves a. Hence, by Proposition 8.11, we have an automorphism of 
E/F that moves a. This shows that Inv G = F. Since we can replace F by any subfield K of E/F 
and G by Gal E/K, we see that Inv(Gal E/K) = K. Now let H be a closed subgroup and let K = 
Inv H. Let L/K be a finite dimensional Galois subfield of E/K. By Proposition 8.11 (with K 
replacing F and L replacing K), Gal E/K and H c Gal E/K stabilize L and the set of 
restrictions of the s e Gal E/K to L constitute Gal L/K. The set H of restrictions of the A € H to 
L constitute a subgroup of Gal L/K. Hence, if this subgroup is proper, by the finite Galois 
theory there exists a č € L, ¢ K such that h(é) = é, h e H. This contradicts the definition of K = 
Inv H. Hence h = Gal L/K. Thus for any s e Gal E/K there exists an h €e H such that s|L = t|L. 
By Proposition 8.10, this implies that s is in the closure of H. Hence s e H. Thus Gal E/Inv H 
= H. This proves that the two maps are inverses. Now if K = Inv H and s € G, then s(K) = 
Invs(H)s ~ !. This implies that H < G if and only if s(K) = K for every s e G. In this case K is 
Galois over F, since the set of automorphisms s|K is a group of automorphisms in K whose set 
of fixed elements is F. As in the finite Galois theory, it follows that Gal K/F = (Gal E/F)/(Gal 
E/K). Finally if K is any subfield of E/F that is Galois over F, then any automorphism of E/F 
stabilizes K, so s(K) = K for s e G. Then H = Gal E/K is normal in G. O 


In the applications of infinite Galois theory it is useful to view the topology of the Galois 
group in a slightly different fashion, namely, as inverse limit of finite groups. If L/F is a finite 
dimensional Galois subfield of E/F, then H = Gal E/L is a normal subgroup of finite index in G 
= Gal E/F since G/H = Gal L/F, which is a finite group. Conversely, any normal subgroup of 
finite index is obtained in this way. Proposition 8.10 shows that E is the union of the subfields 
L such that L/F is finite dimensional Galois over F. It follows that {\H = 1 for the set H of 
normal subgroups of finite index in G. It is easily seen that G is the inverse limit of the finite 
groups G/H and that the topology in G is that of the inverse limit of finite sets. It is easily seen 
also from this or by using the Tychonov theorem that G is a compact set. 


EXERCISES 
1. Prove the last two statements made above. 


2. Let F be Galois in E and assume that Gal E/F is compact. Show that E/F is algebraic and 
E is a splitting field over F of a set of separable polynomials. (Hence Krull’s theorem is 
applicable to E/F.) 


3. Show that Gal F pF, has no elements of finite order # 1. 


4. Let E = F(t), t,...), the field of fractions of Flt), to, ...], the polynomial ring in an 
infinite number of indeterminates. Show that Gal E/F is not closed in the finite topology. 


5. Show that Gal F/F, = Gal F q/Fq for any two primes p and q. 


6. A Steinitz number is a formal product N= []P¥ over all of the primes p; € Z where k; = 
0, 1,2,... or œ. If M = JIP, we write M|N if l; < k; for all i. Note that the set of Steinitz 


poca 
numbers is a complete lattice relative to the partial order M < N if MIN. Call the sup in 
the lattice the least common multiple (1.c.m.) of the Steinitz numbers. If E is a subfield 


of F,,, define deg E to be the Steinitz number that is the 1.c.m. of the degrees of the 
minimum polynomials over F, of the elements of Æ. Show that this gives a 1 — 1 
correspondence between subfields of F p and Steinitz numbers. 


7. Show that Gal F ED is uncountable. 


3.7 SEPARABILITY AND NORMALITY 


We shall now investigate the structure of algebraic extension fields of a given field F. Most of 
what we shall do becomes trivial in the characteristic 0 case but is important for fields of 
prime characteristic. We operate in an algebraic closure F of F and consider its subfields. This 
amounts to looking at all algebraic extensions E of F, since any such extension is isomorphic to 
a subfield of F/F. It will be clear that everything we do is independent of the imbedding of E in 
F and we shall not call attention to this fact in our discussion. We remark also that F is an 
algebraic closure for any of its subfields E/F. 

Let I be a set of monic polynomials with coefficients in £ and let E be the subfield of F/F 
generated by the roots in F of every f(x) e T. Clearly £ is a splitting field over E of the set I. It 
is clear also that any homomorphism of E/F into F/F stabilizes the set of roots of every f(x) 
and hence stabilizes E. Consequently it is an automorphism of E/F. We shall now call an 
algebraic extension field E/F normal if any irreducible polynomial in F[x] having a root in EF is 
a product of linear factors in E[x]. It is evident from this definition that a normal extension is a 
splitting field, namely, the splitting field over E of the set of minimum polynomials of its 
elements. The following result therefore gives an abstract characterization of splitting fields as 
normal extensions. 


THEOREM 8.17. Jf E is a splitting field over F of a set IT of monic polynomials with 
coefficients in F, then E is normal over F. 


Proof. Let f(x) e F[x] be irreducible and have a root r in E. We have f(x) = [],"(x — 7,) in F 
[x] where rų = r, and we have to show that every r; e E.Consider E(7;). This field contains 


F(r,) and is a splitting field over F(r;) of the set I. Since r; and r; for any i are roots of the 
same irreducible polynomial in F[x], we have an isomorphism of F(r)/F onto F(r;)/F sending r 
» r; Since E(r) and E(r;) are splitting fields over F(r) and F(r;) of the set I, Theorem 8.2 
shows that we have an isomorphism of F(r) onto F(r;) extending the isomorphism of F(r) onto 
F(r,). Since Æ is a splitting field over F of T, this isomorphism stabilizes Æ. Since r e E, we 
have E(r) = E. Hence F(r;) = E and r; € E for every i. O 


If E is an arbitrary subfield of F/F, we can form the splitting field N in F of the set of 
minimum polynomials of the elements of E. Evidently N > E and N is normal over F. It is clear 
also that N is the smallest normal subfield of F containing E£. We call N the normal closure of 
E. 

We recall that an algebraic element is called separable if its minimum polynomial over F is 
separable, and an algebraic extension E/F' is separable if every element of E is separable 
(BAI, p. 238). We remark that if a € E is separable over F, then it is separable over any 
intermediate field K since its minimum polynomial over K is a factor of its minimum 
polynomial over F. If E is Galois over F, then it is separable and normal over F. For, ifa € E, 
the orbit under Gal E/F is finite, and if this is {a, = a, d»,..., a,.} then the minimum polynomial 
of a over F is | [i (x — a,). Since this is a product of distinct linear factors in E[x], a is 
separable. Hence E is separable and normal over F. 

Now let SF be the subset of elements of F that are separable over F. It is clear that SF is 
contained in the splitting field over F of all the separable monic polynomials in F[x]. By 
Theorem 8.16, the latter field is Galois over F and hence it is separable and normal over F. 
Then SF contains this field and hence SF coincides with the splitting field over F of all 
separable monic polynomials in F[x]. We shall call SF the separable algebraic closure of F. 

Now let a e F be separable over SF and let f(x) be its minimum polynomial over SF. Then 
f(x) has distinct roots in F and so (f(x), fx) = 1 if f(x) is the derivative of f(x) (BAL p. 230). 
Now apply G = Gal SF/F to f(x) in the obvious way. This gives a finite number of distinct 
irreducible polynomials f(x), 1 < i, < r, fix) =f() in SF[x]. We have (f(x), f()) = 1 and 
(F(x), G) 1 ifi #7. Then g(x) =] J" fix) € Fix] and (g(x), g'(x)) = 1, so g(x) is separable. 
Since g(a) = 0, we see that a is separable over F and soae SF. 

We can use these results to prove 


THEOREM 8.18. Jf E is an algebraic extension of F, then the subset of elements of E that 
are separable over F form a subfield SE of E/F. Any element of E that is separable over SE 
is contained in SE. 


Proof. The first statement is clear since SE = SF N E. The second is immediate also since if a 
€ Eis separable over SE, then it is separable over SF > SE. Hence ae SF N E = SE. O 


An algebraic extension E/F is purely inseparable over F if SE = F and an element a € E is 
purely inseparable over F if SF(a) = F. The second part of the last theorem shows that E is 


purely inseparable over SE. If the characteristic is 0, then E/F is purely inseparable if and only 
if E = F and a is purely inseparable if and only if a e F. The interesting case for these 
considerations is that in which the characteristic is p # 0. For the remainder of the section we 
shall assume that we are in this situation. We proceed to derive some useful criteria for an 
element of a to be separable or to be purely inseparable. We prove first 


PROPOSITION 8.12. Ifae E, then a is separable if and only if F(a) = F(a?) = F(a’) = 


Proof. Ifa is not separable, then its minimum polynomial is of the form f(x) = g(x”) (BAI, p. 
231). Then g(x) is irreducible, so this is the minimum polynomial of aP. Since [F(a) : F] = deg 
f(x) and [F(aP) : F] = deg g(x), we have [F(a) : F(a’)] = p and F(a) + F(a’). Next assume that 
a is separable. Then the minimum polynomial f(x) of a over F has distinct roots and hence so 
has the minimum polynomial A(x) of a over F(a’). Now a is a root of x? — ae F(a?)[x] and x? 
—aP = (x — a). Then h(x)? — a? and since h(x) has distinct roots, it follows that h(x) = x — a. 
Then a € F(a’). Taking pth powers we see that aP € F(a’), so a € F(a’). Iteration gives the 
required relations F(a) = F(a’) = FP) =....0 


We prove next the following criterion for purely inseparable elements. 


PROPOSITION 8.13. Jf a is purely inseparable over F, then its minimum polynomial over 


F has the form x” — b. On the other hand, if a is a root of a polynomial of the form x” — b e 
F[x], then a is purely inseparable over F. 


Proof. Let f(x) be the minimum polynomial of a and let e > 0 be the largest integer such that 
f(x) is a polynomial in x". Write f(x) = g(x”", g(x) € F[x]. Then g(x) is irreducible in F[x] and 
g(x) is not a polynomial in x?. Then g(x) is the minimum polynomial of b = a” and b is 
separable over F. Moreover, x” — b is the minimum polynomial of a over F(b). Hence if a is 
purely inseparable over F, then b = a” eF, F(b) = F, and x” — b is the minimum polynomial of 
a over F. Next assume that a is a root of x” — b, b € F. Then the formula for the p°th power of 


a sum implies that any c € F(a) is a root of a polynomial of the form x” — d= (x — c) with d 
€ F. Then the minimum polynomial of c over F has multiple roots unless it is linear. Hence if c 
€ F(a) is separable, then c e F. Thus a is purely inseparable. O 


We shall now look more closely at the structure of a normal algebraic extension in the 
characteristic p case. Let E/F be normal and let a € E. Write the minimum polynomial f(x) of a 


over F as g(x?) where g'(x) # 0. Since g(x) is irreducible in F[x] and has the root a e E, 
g(x) =]; @ — 5), b= aP“ in F[x]. The b’ are distinct since g'(x) + 0. Now f(x) = g(x?) = 
[],’@ — b) is irreducible in F[x] and has the root a in F. Hence f(x) = []"\(« — a) a, = a, in 


E[x]. It now follows that every x” — b, has a root in F and we may assume that this is Then P 
-b= (x= ap and f(x) = [],/(«- a; = hP" where h(x) = [],/@ — a) € Efix]. The relation 


f(x) = h(x)" shows that the coefficients of A(x) satisfy equations of the form xP =c e F. Hence 
these are purely inseparable elements of E. Now it is clear that the subset of purely 
inseparable elements is a subfield P/F and the coefficients of A(x) are contained in P. 


Moreover, a is a root of A(x) and A(x) = ] [œŒ — a,) with distinct a;. Hence a is separable over 
P. Since a was arbitrary in E, we have proved 


PROPOSITION 8.14. IfE is normal over F, then E is a separable extension of its subfield 
P of purely inseparable elements. 


This result is striking in that it shows that whereas any algebraic extension is built up by 
first making a separable extension and following this with a purely inseparable one, the order 
can be reversed for normal extensions. We now determine the structure of arbitrary extension 
fields that can be constructed in the second manner. 


THEOREM 8.19. (1) Jf E is an algebraic extension field of F such that E is separable over 
its subfield P of purely inseparable elements, then E = P ® pS where s is the maximal 


separable subfield of E. Conversely, if P is a purely inseparable extension of F and s is a 
separable algebraic extension of F, then P ®,S is an algebraic extension field of F whose 


subfield of separable elements is s and subfield of purely inseparable elements is P. 
(2) If E is normal algebraic over E then E = P ®p S as in part (1), with s Galois over F. 


Conversely, if P is purely inseparable over F and s is algebraic and Galois over F, then P ® 
pS is a normal algebraic extension field of F. 


Proof. (1) Assume first that [E : F] < œ. If a is any element of E and f(x) is its minimum 
polynomial, we have seen that f(x) = g(x?) where g(x) is separable. Then a” € S. It follows 
that for any finite subset {aj,..., a,.} in E there exists an e such that ajp° € s for all i. Since the 
elements having this property form a subfield and since F is finitely generated, there exists an e 
such that a? € S for every a € E. Now let (xj,...,x,) be a base for E/P and write 5 Fea ie 
1 Cikk, Where the cj, € P. Then yy; = didi, for yj = XP iV = Cp. The y; € Sand the dj, 
e SN P= F. The multiplication table for the y; shows that }’,” Fy; is an F-subalgebra of E and 
$ "Fy; is a P-subalgebra of F. Now let a € E and write a = ¥ "ax, a; € P. Then a = 
yi? awe Since a is separable over p, a € P(aP^) = Pia’’]. This and the formula for aP“ imply 
that a € }p,,, so the y; generate E as vector space over P. Since the number of y; is n = [E : P], 
these form a base for E/P. The foregoing argument shows also that ifa € S, thena € )' Fy; It 


follows that (j,..., y,) is a base for S/F. The existence of a set of elements that is 
simultaneously a base for E/P and for S/F implies that E = P ® pS. This proves the first 


assertion when [EF : F] < œ. Now let [E : F] be arbitrary. Let L be a finite dimensional subfield 
of E/F. Then S N L is the subfield of separable elements of L and P N L is the subfield of 
purely inseparable elements of L. We show next that L can be imbedded in a finite dimensional 
subfield L’ such that L' is separable over P N L'. Let L = F(ay,..., a,) and let f(x) be a 
separable polynomial with coefficients in P having a; as root. Let pj,..., p, be the coefficients 
of all of the f(x) and put L' = F(a),..., a,5 Pj,---, Ps). Then L' = (L' N P)(aj,..., a,.) and the a; 
are separable over L’ N P. Hence every element of L' is separable over L’ N P, which means 
that L satisfies the first condition. To complete the proof we need to show that any element of F 
is a linear combination of elements of S with coefficients in P and that elements of S that are 
linearly independent over F are linearly independent over P. Since any finite set of elements 
can be imbedded in a finite dimensional subfield L’ satisfying the conditions, both of these 
results are clear from the first part. This completes the proof of the first statement in (1). 
Conversely, assume E = P ®pS where P is purely inseparable and s is a separable 


algebraic extension of F. Let (x,) be a base for S/F. We claim that for any e = 0, OPa) is also a 
base for S/F. The argument is similar to one we used before: Let a E S and write a = >} a,x,, 
a, E F. Then d =y a x” e Y Fx", which is an algebra over F. Since a € Fla’ ] a € 
Fx, so any a is a linear combination of the x”. To show linear independence of the PX it 
is enough to take a finite subset, say, {x),..., x,}. Then the subfield E(x,,..., x,) is finite 
dimensional separable over F and has a base (x1,..., Xps Vj---» Ym). Then the elements x,P°,..., 
XPE, YPE.. Y El are generators for E(x),...,x,,) as vector space over F. Since their number is 
the dimensionality, they are linearly independent over F. Hence {x,?°,..., x,?°} is a linearly 
independent set. Since the x, forma base for S/F, every element of P ®,S can be written in one 
and only one way in the form ¥ b, ® x,, ba E€ P. There exists an such that every b,?° e F. 
Hence (X b, 8 x,)P° =} bF @ x,P° = 1 @=e Sand ifb, ®@ x, + 0, then some b, # 0. Then 
bP? # and ©) b Pex P # 0. Since Sis a field, this element has an inverse in S and hence £ b, 0 
x, 1s invertible. This proves that P 0 ; s is a field. Clearly P is a purely inseparable subfield, S 
is a separable algebraic subfield of E, and E = P ®,S = P(S). Thus E is generated over P by 


separable algebraic elements. Hence E is separable algebraic over P. Then, by the result we 
proved first, £ = P ®p S' where P’ is the subfield of purely inseparable elements and S” the 


subfield of separable ones. Then P' D> P, S' > S and since E = P NFS, it follows that P’ = P 
and S' = S. This completes the proof of (1). 

(2) If E is normal over Æ E is separable over its subfield of purely inseparable elements, 
by Proposition 8.14. Hence E = S N pP as in (1). Let a e S have minimum polynomial fx). 
Then fx) is separable and f(x) = [],’ (x — a;), a, = a, in E[x]. Since S is the set of separable 
elements of E, every a; € S. This implies that S is a splitting field over E of a set of separable 
polynomials, so S is Galois over E by Theorem 8.16. Conversely, assume that E = s MP 
where Sis algebraic and Galois over E and P is purely inseparable. We have shown in (1) that 


E is an algebraic field over EF. It is clear that every automorphism of S/F has a unique 
extension to an automorphism of E/P and this implies that E is Galois over P. Leta € E and let 
fix) € Fix], h(x) e P[x] be the minimum polynomials of a over F and P respectively. Then 
h(x)P* e F[x] for a suitable e, so f(x) = h(x)?°. Since E is Galois over P, h(x) is a product of 
a factors in E[x]. Then f(x) is a product of linear factors in E[x]. Thus £ is normal over F. 


These results apply in particular to the algebraic closure F of E. We have F = SF @, PF 
where SF is the subfield of E of separable elements and PF is the subfield of purely 
inseparable elements. SF is Galois over E and we have called this field the separable 
algebraic closure of E. We recall that a field is called perfect if every polynomial with 
coefficients in the field is separable (BAI, p. 226). For characteristic 0 this is always the case, 
and for characteristic p it happens if and only if every element of the field is a pth power. It is 
easily seen that PF is perfect and that this is contained in any perfect extension of F. For this 
reason PF is called the perfect closure of F. 


EXERCISES 


. Let E/F be algebraic and let K = F(x,,..., x,), the field of fractions of FLx),..., x], x; 
indeterminates. Show that E ®@;K = E(x),..., x,) and hence E ®pK is a field. 


— 


2. Show that E(x) ®pF(y) is not a field if x and y are transcendental. 


3. Show that if E/F contains a purely inseparable element not in F, then E® pE has a non- 
zero nilpotent element. 


4. (J. D. Reid.) Suppose that Fo is not perfect and that char Fy =p. Let a € Fo, € Fo (the 
subfield of pth power). Let E = F(x), x transcendental over Fy. Put y = PP + a) ~ ! 
and F = Fo). Show that E is not separable over F and that P(E/F), the subfield of E of 
purely inseparable elements over F, is F. 


5. Let E and F be as in exercise 4. Show that E ®p E contains non-zero nilpotent elements. 


6. If E is a finite dimensional extension field of F, where S = SE, the subfield of separable 
elements, put [E : F]; = [S : F] and [E : E]; = [E : S]. These are called the separability 


degree and inseparability degree respectively of E/F. Evidently [E : F] = [E : F]ş [E : 
E],. Show that if K is finite dimensional separable over F, then [K : F]; = [K : E], [E : 
F];. Show that this holds also if K is purely inseparable over F. Finally show that [K : 
F]; = [K : E], and [K : F]; = [K : E],£ : F]; holds for arbitrary finite dimensional 
extension fields of E/F. 


7. Show that an algebraic extension of a perfect field is perfect. 


8. Let P be purely inseparable over F. We say that P/F has exponent e > 0 if a?° e F for 
every a € P but there exist a such that a??~ ! ¢ F. Show that P/F has an exponent if [P 


:F] < oo and that if P/F has exponent e then 
0 l e 
P = P! Cap "Dee D Pl ae F, 


where P = F(PP ^. Show that PË- D/PO is purely inseparable of exponent one. 


9. Let P/F be finite dimensional purely inseparable of characteristic p. Show that P has a 
subfield Q/F such that P/Q = p. 


10. Let P = F(a) where char F = p and a is algebraic with minimum polynomial AP° — a over 
E. Determine all the subfields of P/F 


3.8 SEPARABLE SPLITTING FIELDS 


We shall now prove the existence of a separable splitting field for any finite dimensional 
central simple algebra A/F. This will enable us to upgrade the results on the Brauer group that 
we obtained in section 8.4 and make them apply to arbitrary finite dimensional central simple 
algebras. As a consequence we shall obtain the result that the Brauer group Br(F) for any field 
F is a torsion group. 

The theorem we want to prove is 


THEOREM 8.20. If A is finite dimensional central simple over F, then there exists a finite 
dimensional separable field S/F that is a splitting field for A. 


It suffices to prove this for A = A, a division algebra. In this case the result will follow 
from the Corollary to Theorem 4.7 if we can show that A contains a maximal subfield that is 
separable. To prove this we shall need a few remarks on derivations of algebras. 


We note that if A is any algebra and d € A, then the map Dy : x ~ [dx] = dx — xd is a 
derivation. It is clear that D, is linear and the calculation 


[dx]y+x[dy] = dxy—xdy+xdy—xyd 
= dxy—xyd = [d, xy] 


shows that D,(xy) = (Dax)y + x(Dyy). We call Dy the inner derivation determined by d. 
Evidently we have the relation D; = d; — dp where d; and dp are the left and right 
multiplications determined by d. Since d; and dg commute, we have D, = (d; — dp) = $4; = 0 
()(— 1) 7) d,'dp* — i. If the characteristic is p, this gives DP = (Œ); — (d’)p = Dyp. We can 
write this out as 


(41) Pm l 
[ala [d] TI) = [aP] 
We are now ready to prove 


THEOREM 8.21. Any finite dimensional central division algebra A/F contains a maximal 
subfield s that is separable over F. 


Proof. There is nothing to prove if A = F, so we assume A # F. We shall show first that A 
contains an element a ¢ F that is separable over F. Choose any a € A, ¢ F. If F(a) is not 
purely inseparable, then it contains a separable element not in F and we have what we want. 
Next suppose that F(a) is purely inseparable. Then the characteristic is p and the minimum 
polynomial of a over F has the form x?° — a. Put d = aP° - ! Then d ¢ F but dP e F. Then the 
inner derivation Dy + 0, but DP = Dgr = 0. Hence we can choose a € A so that [da] = b # 0, 


but [db] = 0. Now, put c = ab" 'd. Then 
Dac = (Dia)b™ 'd = 


Hence [dc] = de — cd = d and ded 7! = c + 1. We now see that the subfield F(c)/F has a non- 
trivial automorphism, namely, the restriction to F(c) of the inner automorphism x ~ dxd 7 !. It is 
readily seen that if E/F is purely inseparable, then the only automorphism of E/F is the identity. 
Hence F(c) is not purely inseparable. Then we have a separable element in F(c) not in F. Thus 
in any case we have ana ¢ F that is separable over F. 

We shall complete the proof by showing that any maximal separable subfield S of A is a 
maximal subfield of A. Otherwise, A' = C,(S) # S. By the double centralizer theorem 
(Theorem 4.10, p. 224), S is the center of A' so A' is a central division algebra over S with 
[AA' : S] # 1. By the result we proved first, we have a subfield S(a) ¢ S that is separable over 
S. This contradicts the maximality of S as separable subfield of A since S(a) is separable over 
F.D 


As we have seen, Theorem 8.21 implies Theorem 8.20. 


Now let s be a finite dimensional separable splitting field for A and let E be the normal 
closure of S. Then Æ/F is finite dimensional Galois. Since any extension field of a splitting 
field is a splitting field, E/F is a splitting field for A. Let A be the division algebra associated 


with A by the Wedderburn theorem and let d be its index. Then, by Theorem 8.12, Af ~ 1. 
Evidently this implies 


THEOREM 8.22. Br(F) for any field F is a torsion group. 


The order of {A} in Br(F) is called the exponent of the central simple algebra A. Since 
{At¢ = 1 for the index d of A, we see that the exponent e of A is a divisor of its index. 


EXERCISES 


1. Let p be a prime divisor of the index d of a central simple algebra A. Show that ple, the 
exponent of A (Hint: Let E be a Galois splitting field of A and let [E : E] = m. Then d\m 
so p\m. By the Galois theory and Sylow’s theorem, there is a subfield K of E/F such that 
[E : K] =p! where p’ is t E highest power of p dividing m. Let e’ be the exponent of A’ = 
Ax in Br(K). Show that e’le and that e'is a power of p different from 1.) 


2. Show that if A is a central division algebra of degree d = pj'...p,/,1; > 0, pı distinct 


primes, then A = A, ® ... ® 4, where the degree of A; is p/. (The degree is the square 
root of the dimensionality. Exercise 9, p. 226, is needed in the proof.) 


3.9 KUMMER EXTENSIONS 


We recall that a finite dimensional extension field E/F is called abelian if E is Galois over F 
and the Galois group G of E/F is abelian. The same terminology can also be used for infinite 
dimensional extension fields. However, we shall confine our attention to the finite dimensional 
case. In this section and in section 8.11 we shall study two types of abelian extensions that are 
particularly interesting because they can be treated in a fairly elementary, purely algebraic 
fashion. 

If G is a finite abelian group, the least common multiple e of the orders of the elements of G 
is called the exponent of G. Evidently e ||G|. In this section we shall be dealing with a base 
field F that for a given positive integer m contains m distinct mth roots of unity and we shall 
give a survey of the abelian extensions whose Galois groups have exponents m'|m. We call 
these extensions Kummer m-extensions of F. 

The hypothesis that F contains m distinct mth roots of 1, or equivalently, that x” — 1 = [],"(x 
— €) in F[x] with distinct č; implies that the characteristic of F is not a divisor of m. The set 
U(m) = {¢| 1 <i < m} is a subgroup of the multiplicative group F* of F and U(m) > U(m') for 
every m'm. We remark also that if G is a finite abelian group of exponent m'|m, then the 
character group G of G can be identified with hom (G, U(m)), the group of homomorphisms of 
G into U(m). For, by definition, G is the group of homomorphisms of G into the multiplicative 
group of complex numbers. Hence if y € ĉ and g e G, then y(g)” = y(g”) = (1) = 1 Hence 
x(g) is contained in the group of complex rath roots of 1 which can be identified with U(m). 
Accordingly, ĉ can be identified with hom(G, U(m)). We recall that |¢| = |G| (p. 281). This fact 
has some important consequences that we shall need. We state these as a 


LEMMA. Let G be a finite abelian group, 6 its character group. Then 
(1) For any s £ 1 in G there exists ap € G such that X(s) # 1. 


(2) For any s € G let §: denote the map p ~ p(s) Ginto ©* (or into U(m), m the 
exponent of G). Then §: € ĉ and s~ §:is an isomorphism of G onto ĉ.. 


(3) A set {%1X2--- Xp} Of characters generate 6 if and only if the only s € G such that 
Hs) = 1, LATS rissa, 


Proof. (1) Let H = {s|p(s) = 1 for all x € ê}. Then H is a subgroup and condition (1) will 
follow if we can show that H = 1. Let G = G/H. Any p € 6G defines a character x on G by x(gH) 
= x(g) and the map p ~ x is “ monomorphism of ĉ into G. Since |G] = |é| and |G| = |G|, this 
implies that |G| > |G|. Hence |G| = |G| and H = 1. 

(2) It is clear that 5:€ d and s ~ §:is a homomorphism. If §:= 1, then y(s) = 1 for all y € ĉ 
hence s = 1 by condition 1. Thus s ~ §: is a monomorphism. Since |G| = |ê] = |ê], it follows that 
s ~œ s is an isomorphism. 

(3) The result established in (2) has the consequence that we can regard G as the character 
group of ê. Hence assertion (3) is equivalent to the following statement: {5),..., s,} generate G 
if and only if the only y in ê such that y(S;) = 1, 1 <i <r, is y = 1. Suppose that the s; generate 
G. Then y(s;) = 1, 1 <i < r, implies y(s) = 1 for all s and hence y = 1. On the other hand, 
suppose that the subgroup H = < s),..., $, > F G. Then G = G/H +£ 1 and we have a character x 
+ 1 onG. This defines a character x on G by x(g) = x(gH) > which satisfies 7(s) =1, 1 < i < r, 
andy 4 1. O 


Now let E£ be an abelian extension of F whose Galois group G has exponent m'|m. Let E* 
and F* be the multiplicative groups of E and F respectively. Let M(E*) be the subset of E* of 
elements whose rath powers are contained in F* and let M(E*) be the set of rath powers of the 
elements of M(E*). Then M(E*) and M(E*) are subgroups of E* and F* respectively, M(E*) > 
F* and N.E*)>F*” = {ala e F*}. 

Let p e M(E*) and put 


wm te o 
(42) X(s)= (sp),  sEG. 
Since p” € F*, (sp)” =p” and x (s)” = 1, so x,(s) € U(m). Moreover, since U(m) c F*, 


zpst) = (stp)p' = s((tp)p')((sp)p~') 


1 


= (tp)p (spp * = x,(s)x(t). 


Hence x, : s~ x,(s) is a character of G. If p1, p2 E€ M(E*) then %p1P(s) = s0 1P21P) ~ l= (s 


pp ~'Cp Dpr ! 
now prove 


= Xp (S)Xp2(s). Hence p ~ x, is a homomorphism of M(E*) into ê. We shall 


THEOREM 8.23. Let F be afield containing m distinct mth roots of 1 and let E be an 
abelian extension of F whose Galois group G has exponent m'\m. Let M(E*) be the subgroup 
of E* of elements whose mth powers are in F*. 


Then we have the exact sequence 


À A 
(43) | > F* $ M(E*)5G-1 
where & denotes inclusion and d is p ~ x, where x,(s) = s(p)p ~ l s e G. The factor group 
M(E*)/F* = G. We have E = F(M(E*)) and E = F(p;,..., p,) for p; € M(E*) if and only if the 
cosets p,F* generate M(E*)/F*. 


Proof. To prove the first statement we have to show that å is surjective and its kernel is F*. 
Let x e G. Then y(st) = y(,)y(t) and hence there exists a p € E* such that y(s) = (sp)p~ l| (P. 
361). Since (sp)p 7! € U(m), sp” = p” for every s € G. Hence p” € F* and p e M(E*). Thus 
X = Xp for p € M(E*) and x is surjective. Now suppose y, = 1 Then sp =p, s € G, so p € F*. 
Hence ker 1 = F*. 

The exactness of (43) implies that M(E*)/F* = ĉ. Since ĉ = G, we have M(F*)/F* = G and 
the homomorphism 2 of M(E*) onto ĉ gives the isomorphism 4 : pF* ~ xp of M(E*)/F* onto 6. 


Accordingly, the characters x, ,..., x, generate 6. if and only if the cosets p;F* generate 


M(F*)/F*. Suppose this is the case and consider the field EZ’ = F(p),..., p,.). Let H = Gal E/E". 
Ift € H, then=p;, 1 <i <r, so x,(t) = 1. This implies that y(¢) = 1 for every y € ê. Thent = 1 
by statement (1) of the lemma. Thus H = 1 and hence E = F = F(p),..., p,). Evidently this 
implies that E = F(M(E*)). Conversely, let p,..., p, E€ M(E*) generate E/F and let s e G. Then 
Sp; =P 1 <i<r, imply sp =p for everyp € E and s = 1. Thus y,(s)=1, 1 <i <r, implys=1. 
Then by (3) of the lemma, the y, generate 6. and the cosets p;F* generate M(E*)/F*. 0O 


We consider next the map u : p ~ p™(F*") of M(E*) into ME*)/F*™. This is an 
epimorphism and its kernel is the set of p e M(E*) such that p” = a”, a e F*. Then pa` ! € 
U(m) c F*. It follows that ker u = F*. Hence we have the isomorphism # : pF* ~ p”(F*™) of 
M(E*)/F* onto N(E*)/F*”. Then, by Theorem 8.23, ME*)/F*™ = G, so this is a finite subgroup 
of F*/F*™, Thus we see that any Kummer extension whose Galois group has exponent m'm 
gives rise to a finite subgroup ME*)/F*” of F*/F*™. We remark also that if /),..., 6. are 
elements of M(E*) whose cosets £,F*” generate N(E*)/F*”, then E = F(\W Br,..., (Y P>) where 
(YB, denotes a root of x” = 2. 

We shall now show that any finite subgroup of F*/F*™ can be obtained in the manner 
indicated from a Kummer extension, that is, if the given group is N/F*”, then there exists a 
Kummer extension E/F such that N(E*) = N. We remark that since x” = 1 for any xe F*/F*™, the 
exponent of N/F*” is necessarily a divisor of m. We work in an algebraic closure F of F. If p 
e F, we choose a particular element p € F satisfying p” = p and write p = VB. We have 


THEOREM 8.24. Let N/F*” be a finite subgroup of F*/F*" and let B,,..., p, be elements of 


N whose cosets B.F'™ generate N/F*". Then E = FB Baila (Bn) is “ Kummer m-extension 
such that N(E*) = N. 


Proof. We note first that E is a splitting field of 
(44) F(x) = (x"— B1) (x"—B,). 


This is clear since x” — 6; = | jC — ¢B;) where U(m) = {č ¢,..., 6m}. Moreover, since x” — p; 
has distinct roots, the polynomial f(x) is separable. Then F is a splitting field over F of a 
separable polynomial and hence E/F is finite dimensional Galois. Let s, t € G = Gal E/F. We 


have sp; = Cs pp; and tp; = Či for p; = Vb. Then stp; = ¢s(i)ct(i) = tsy; Hence G is abelian. 


Since spi = č" = pi = pi, s” = 1, and the exponent of G is m'm. Then E is a Kummer m- 
extension of F. 

It remains to show that ME*) = N. Since p” = p; € F*, the p; e M(E*). Since E = F(p),..., 
p,), it follows from Theorem 8.23 that the cosets p;"F'*” generate M(E*)/F*. Hence the cosets 


p)"F*" = B.F*" generate NE*)/F*™. On the other hand, the cosets 6,F*” generate N/F* Hence 
N=ME*).0 


Our results establish a 1 — 1 correspondence between the set of finite subgroups of F*/F*™ 
and the set of Kummer m-extensions of F contained in F. It is clear that this correspondence is 
order-preserving where the order is given by inclusion. Since the set of finite subgroups of 
F*/F*™ is a lattice, we see that the set of Kummer m-extensions constitutes a lattice also and 
our correspondence is a lattice isomorphism. 


8.10 RINGS OF WITT VECTORS 


An extension field E of a field £ of characteristic p £ 0 is called an abelian p-extension of E if 
E is an abelian extension and [E : F] = p°. Alternatively, we can define an abelian p-extension 
as an extension that is finite dimensional Galois with Galois group G a p-primary abelian 
group. We have given a construction of such extensions with [E : F]= P in BAI, p. 300. These 
were first given by Artin and Schreier, who also constructed abelian p-extensions of p7- 
dimensions, in connection with the proof of an algebraic characterization of real closed fields 
(see p. 674). The Artin-Schreier procedure was extended by A. A. Albert to give an inductive 
construction of cyclic p-extensions of p° dimensions. Slightly later Witt gave a direct 
construction of all abelian p-extensions analogous to that of Kummer extensions. Witt’s method 
was based on an ingenious definition of a ring, the ring of Witt vectors, defined by any 
commutative ring of characteristic p. These rings have other important applications and they 
are of considerable interest beyond the application we shall give to abelian p-extensions. 


We start with the ring X= Q[x, yj, z4] in 3m indeterminates x, y;, 2, 0 <i, j, k <m- 1, over 
@). Consider the ring X™ of m-tuples (ap, Ays---» Ay — 1), a; E X, with the component-wise 
addition and multiplication. Let p be a fixed prime number. We use this to define a map 


(45) 4 = (ao, ay, . ey ~> pa = (a, ah... an 1) 
where 

j y 1-1 
(46) qa” =- a + pa; nae a p'a, 0 <v Sm- 7 


Thus a = ag, a) = ag? + pay,.... We introduce also the map P : a ~ Pa =(ag’,ay?,...5 Gy P). 
Then (46) gives 
(47) =a, = (Pa? +p'a, vel. 


Next let 4 = (a, a,..., a(”- ')) be arbitrary and define a map y by y A= {ap Gy5---5 Gn — 1) 
where 


ag = a” 
(48) 


; i =| jan 
a,=—(a” -aj —pat -+= p'a) vèl. 


Direct verification shows that yga = a and gwA = A. Hence ọ is bijective with y as inverse. 

We shall now use the maps g and y = g / to define a new ring structure on X It will be 
convenient to denote the usual (component-wise) addition and multiplication in X™ by @ and 
© respectively and write u = (1, 1,...,1), the unit in X”) Then we define a new addition and 
multiplication in X” by 


= 
a+b= (pa® gb) 
(49) 
—'] 
ab = o~ (pa ob). 
We have 9(0,..., 0) = (0,..., 0) and ọ(l, 0,..., 0) = (1, 1,..., 1) = u. It follows that( X°”) + ,n,0, 
1) where 1 = (1, 0,..., 0) is a ring and g is an isomorphism of (X” + ,n,0, 1) onto (X™ @, © 
,0, u) (Exercise 10, p. 97 of BAI). We denote the new ring as X,, and write (a, a®,..., a ~ 
D) for (glao, Gy5-++5 Am — 1), ete. 
We now examine the formulas for x + y, xy, and x — y for the “generic” vectors x = (Xo, X1, 
e Xm- 1) Y = Wo Yis- <- Vm — 1). For example, we have 


| P=! 5 anes 
(x+y)o = Xo + Yo, (x+y) =Xity)1 p 3 (Px yo 
1 


(xy)o = Xoyo,  (xXy)ı = XoPy1 +X1Yo” + X11. 


In general, if ° denotes one of the compositions +, -, or —, then it is clear from the definitions 
that (x ° y), is a polynomial with rational coefficients and 0 constant term in Xo, Yo, X1, Y1- -Xy 


y,. Also one sees easily that 


(50) kE =n tE e yi) 


where f, is a polynomial in the indicated indeterminates. The basic result we shall now 
establish is that (x ° y), is a polynomial in xo, yo,..., Xy, Y, With integer coefficients and 0 
constant term. 

Let Z[x; y;] = Z[xo Yos- --» Xm- 1» Ym- 1] and write (p“) for p“ Z[x; y;], u Z 0. Then we have 


LEMMA 1. Let uw>1,0<k<m—1,a=(a,), b= (b,) where a,b, € Eix; y], 0<v<m-1, 
o a=(a), ob =(b). Then the system of congruences 


(51) a, = b, (p, O<v<k 
is equivalent to 


(52) a” = b™ (p"*"), = O<v <k. 


Proof. We have a = ap, b® = bo, so the result is clear for k = 0. To prove the result by 
induction on k we may assume that (51) and (52) hold for v < k — 1 and show that under these 
conditions a, = b; (p") if and only if a) = b (p! + *), It is clear that a, = b, (p) if and only 
if p*a, = p*b, (p"**). Hence by (47), it suffices to show that (Pa)“~ D = (Pb)& =- D (pk + 4) 
holds under the induction hypothesis. We have a, = b, (p“), 0< v < k- 1. Since (i) = 0 (p), 1 < 
i <k- 1, this gives a? = b (p“*'),0<v<k-—1. Hence, the induction on k applied to the 
vectors Pa and Pb gives (Pa) ~ ) = (Pb) 9) (p#*!**~— H, which is what is required. 0 


We can now prove the basic 


THEOREM 8.25. If x ° y denotes x + y, xy, or x — y, then (x ° y), is a polynomial in xo, Vo, 
Xis Yis +++ Xp Vy With integer coefficients and 0 constant term. 


Proof. Since (x ° y), is a polynomial in xo, Vg, ..., X,, Y, With rational coefficients and 0 
constant term, it suffices to show that (x ° y),e Z[x; y;]. This is clear for (x ° y)ọ and we 


assume it for (x ° y),, 0 < k< v — 1. By (47), we have 


(53) p'(x oy), = (x oy) — (P(x oy) je- 1) 


and (x ° y)” =x + yM e zix, y;]. The induction hypothesis implies that (P(x ° yo DER 
[x; y;]. Hence by (53), it suffices to show that (x ° y)™ = (p(x 0 y)) P (p”). By (47), we have 
x) = (Px) D (p’) and y) = (Py) À (—p’). Hence 


(x oy)” = x) $y = (Px) N+(Py)" 1) 
= (Pxo Py)" (p'), 


We are assuming that (x ° y), € Z[x; yj], 0 <k <v- 1. For any polynomial with integer 
coefficients we have fxg, Yo... =f(xq’; yo’, =(p). It follows that (P(x ° y) Jy = ( (Px) ° (Py) 
X (p), 0<k<v- 1. Hence by Lemma 1, we have 


(55) (P(x 0 y))"7 P = (Px o Py)” ” (p’). 


By (54) and (55), (x ° y)™ = (P(x ° y))~ D(p”), which is what we needed to prove. D 


(54) 


It is convenient to write the result we have proved as 


(x + Y) ye Sy(Xo, Voses Xis We Z[xi, yj] 
(56) (xy), = my(Xo, Yos: -s Xv WE ZLX, yi] 
(x = y) a d\(Xo, VOs+++9Xyy Yy) E Z[ xi, yj]. 


Let y be an algebra endomorphism of X / O. Suppose that yx, = a,, yy, =b,,0<v<m-1. 


Then we have yx = a™, ny™ = bO), n(x + yy) = 9x™ + ny = aM) + bO, and n((x + y),) = 
(a + b),. Hence by (56), 


(57) (a+b), = S,(do, bo, ..., ay by). 
Similarly, 

(58) (ab), = m,(do, bo, ... yy Dy). 
(59) (a—b), = d,(do, bo, ..., Ay, by). 


Since there exists an endomorphism of X/@ mapping the x,, and y,, into arbitrary elements of X, 


the foregoing formulas hold for arbitrary vectors a =(do, a),..., am- 1), b = (bo, 5y,..., Bm- 1) € 
Xm 

We are now ready to define the ring W,,(A) of Witt vectors for an arbitrary commutative 
ring A of characteristic p. The set W,,(A) is the set of m-tuples (do, @),..., Am- 1), 4; E A, with 


the usual definition of equality. We define addition and multiplication in W,,,(A) by 


(a T b), = 5\(do, bo, vey yy b,) 
(ab), me m,(do, bo, -sly b,) 


where the right-hand sides are the images in A of S$,(Xo, Yos...» Xy Vy) and W,(Xo, Yos- -> Xys Yy) 
respectively under the homomorphism of Z[x;, y;] into A such that x; ~ a, y; ~b, 0<i<m-1. 
Also we put 0 = (0,..., 0), 1 = (1, 0,..., 0) in W,,(A). Then we have the structure (W,,(A), +, °,0, 
1). We shall now prove 


(60) 


THEOREM 8.26. (W,,(A), +, n, 0, 1) is a commutative ring. 


Proof. To prove any one of the defining identities for a commutative ring— i.e., the 
associative laws, the commutative laws, distributive laws— we let a, b, c be any three 
elements of W,,(A). We have the homomorphism y of Z [x;, y;, Zg] into A such that x, ~ a, y, ~ 


b,, Zy % Cy, 0<v<m-l. Let I,, denote the subset of X,, of vectors whose components are 
contained in 2[x;, y;, zz]. By (57)-(S9) this is a subring of X,,. Moreover, comparison of (57)— 
(59) with (60) shows that (uo, Wy,...5 Up — 1) ~ (Nuo, U4,---> Um — 1) is a homomorphism of 
(L,» +.) into (W,,(A), +, +). This homomorphism maps x = (Xo, X15- - -> Um- 1) Y = Wos Yis- <- Yn — 
i], Z = (Zo Z15--+» Zm — 1) into a, b, c respectively. Since (xy)z = x(yz) in I„ we have (ab)c = 
a(bc). Hence the associative law of multiplication holds in W,,(A). In a similar manner we can 


prove the other identities for multiplication and addition. The same type of argument shows 
that 0 is the 0-element for addition and 1 = (1, 0,..., 0) is the unit for multiplication. To prove 
the existence of negatives we apply our homomorphism to — x. Then the image is the negative 
of a. It follows that (W,,(A), +, n 0, 1) is a commutative ring. D 


We shall call W,,(A) the ring of Witt vectors of length m over A. It is clear that W,(A) can be 


identified with A since a ~(a) is an isomorphism. It is clear also that if 7 is a homomorphism 
of A into another commutative ring 4' of characteristic p, then (do, @y,..-, Ap, — 1)  (YAp, NQ),.--5 


Nam- 1) is a homomorphism of W,,,(A) into W,,(4'). In this way we obtain a functor W,, from the 
category of commutative rings of characteristic p into the category of commutative rings. In 
particular, if A is a subring of A, then W,,(A) is a subring of W,,(A'). 

We shall now consider some of the basic properties of the ring W,,(A). For this purpose we 
introduce three important maps P R, and V. We note first that since A is a commutative ring of 


characteristic p, we have a ring endomorphism r ~ 7P in A. This gives rise to the Frobenius 
endomorphism P of W,,(A) where Pa = (ag, a4°,..-5 Gm P) for a = (ap, Ays-++5 Gj 1). Next 
we define the restriction map R of W,,(A) into W,,,_ ,(A) by 


(61) R(do, Q1,.++5Qm—1) = (Ao, ais n=) 
and the shift map V of W,,,_ (A) into W,,(A) by 


(62) V (do, a15... , Am-2) = (0, do,.--54m-2)- 


It is immediate that R is a ring homomorphism and we shall see that V is a homomorphism of 
the additive groups. We have 


RV (do, 41, +5 am- 1) = (0, do,...;4m-2) = VR(do, G15... sm-1), 


so VR = RV as maps of W,,(A) into itself. It is clear also from the foregoing formula that ( VR)” 
= 0. Moreover, we have PV = VP as maps of W,, _ ,(A) into W,,(A) and RP = PR as maps of 
W,,(A) into Wp- (A). 

We prove next the following 


LEMMA 2. The following relations hold in Witt rings: 


rh 
pl=14+1-+1=RVI, 


) 
(64) V(a+b)= Va+ Vb, 

) (Va)b = V(aPRb), ae W, (A), be Wn+1(4), 
(66) pa = RV Pa. 


Proof. Consider the subrings J/,, _ 1 Z 


m Im+1 Of Xn- 1Xm Xm+ respectively whose vectors 


have components in 2[x;,y;]. We define the maps R and V for these rings in the same way as for 
Witt rings and we define P as before. Consider the unit 1 = (1, 0,...,0) of Z, We have (g = (1, 
1,..., 1) and hence (PJ) P < v < m- 1. On the other hand, RV / = (0, 1,... ,0), so the 
definition of ọ gives (RV1) = 0, (RVI) = p, 1 < v < m- 1. It follows that (RVI) = (pl) 
(p), 0, < v < m- 1. By Lemma 1, this implies that (RVI), = (pl), (p). Now we have a 
homomorphism of Z„ into W,,,(A) sending x = (Xo, X1,- --Xm- 1) ~ (ap, ajs +» am- 1) Y = Wor Vy e+ 
Ym- 1) bo, 6;..., Dm- 1). lf we apply this to RV 1 and to p1, we obtain formula (63) from the 
foregoing relations and the fact that A has characteristic p. 
Next we note that Vx = (0,x9,....Xm_ 1), P= (0,V0,--+s Ym- 1). Hence 


(Vx) = pxb + px? tee + pX,- 
(67) ans 
= px", lS<v<m. 
Since (x + y)™ =x + y™, (67) and (vx) = (Vx) = (Vox + y))© = 0 give (Vix + y)) = 
(Vx) + (Vx), 0 <v <m. If we apply the homomorphism of 2[.x, yj], into A such that x, ~ a,, 
Y, “bp, 0<v<m-— 1, to these relations we obtain (V(a + b))™ = (Va + (Vb), 0<v<m. 


This gives (64). 
To prove (65) we shall show that 
(68) ((Vx)y), = V(xRPy), (p),  0<v<m, 


for x = (Xo,---5Xm- 1) and y = Vo, Yp- Ym. Put (Vx)y = (Wo, Wy,---> Winds ViXRPY) = (to, ths 
t,,). Then we have to show that w, =1,(p), 0 < v <m. By Lemma 1, this is equivalent to w) = 
{” (p’*!). This holds for v = 0 since w® = 0 = 1). For v > 1, we have, by (67), that w = 
px’ yy) and t = px (PRy) 1). Since y”) = (Py) D + py,, this gives the congruences 


w! ⁄ px" = yf v) = px! v (Py)! v-1) 
= px” PRy)”- 1) = t) ip 1 ). 


Hence (68) holds. If we apply the homomorphism of 2[Xo,..., Xm- 1 Yos- --» Ym] to A such that x, 
~ Ay, Y, ~ b, to (68), we obtain the required relation (65). 


If we apply R to both sides of (65), we obtain (RVa)Rb = RV(aPRb). Putting a = 1 in this, 
we obtain (RV1)(Rb) = RVPRb. Since RV1 = p1 by (63) and since Rb can be taken to be any 
element of W,,(A), this gives (66). 0 


We can now derive the basic properties of W,,(A) that we shall need. We prove first 


THEOREM 8.27. The prime ring of W,,(A) is isomorphic to 2/(p"). It consists of the Witt 
vectors with components in the prime ring of A (= 2/(p)). 


Proof. By (63), pl = RV 1. Then by (66), p*1 = (RV)’1. Iterating this gives p*l = (RV)*/. 
Hence p”~ 11 =(0, 0,..., 1) #0 and p”! = 0. Hence the prime ring is isomorphic to 2/(p”). The 
prime ring of A can be identified with 2/(p) and the subset of Witt vectors with components in Z 
/(p) is a subring of cardinality p”. Hence it coincides with the prime ring of W,,(A). 0 


We prove next the important 


THEOREM 8.28. The map (do, a)..., Am- 1) ~ ag is a homomorphism of W,,(A) onto A 


whose kernel N is a nilpotent ideal 


Proof. We have seen that R is a homomorphism of W,,,(A) into W,,,_ (4). Iteration of this gives 
a homomorphism R”~ 1 : (d,...5 Gn 1) ~ (Ao) of W,,(A) into W,(A). Since we can identify (ay) 
with ay € A, we have the homomorphism (dp, @..., Gy, — 1) ~ ao, Which is clearly surjective. 
The kernel N of the homomorphism is the set of elements of the form (0, a)..., a,,_ 1), so N = 
RVW,,,(A). We shall show that N” = 0. If we apply R to (65) we obtain (RVa)Rb = RV(aPRb), a 
E€ W,,(A) b € Wn + (A). Since Rb can be taken to be any element c of W,,(A), this gives the 
relation (RVa)c = RV(aPc) for any a, c e W,(4). Then (RVa)(RVc) = RV(aPRVc) = 
RV(aRVPc) = RV((RVPc)a) = (RV)* (PcPa) € (RVW (A). Thus Mc (RV)?W,,(A) = (RYN. 
Now suppose that for some k > 2, Nt c MRV)*~ 2N c (RV) 'N. Then if d = RVa € Nand b 
e NÝ, we have b = (RV)*c, c e W,, (A), since Nt c (RV)! N = (RV} W,, (A). Hence db = 
(RVa)((RV)‘c) € MRV} 1N, so N+! c NRV} 'N. Moreover, if a,c e W,,(A) then 


(RVa\((RV¥c) = RV((RV Pa)((RV)*" 'c))e RV(N(R V)F-2N) c (RVÝN. 
Thus for any k we have M c (RV)! N = (RV)‘ W,,(A) Since (RV)” = 0, this gives N” = 0. D 
An immediate consequence of Theorem 8.28 is the 
COROLLARY. An element a = (do, a),..., Ay — 1) E W,,(A) is a unit in W,,(A) if and only if 
Ag is a unit in A. 


Proof. Ifa is a unit, then so is dp since we have a homomorphism of W,,(A) into A sending a 
into apy. Conversely, if ag is a unit, then a = a' +z where z € Nand A' + Nis a unit in W,,(A)/N. 
It follows that a is a unit. O 


EXERCISES 


1. Obtain the formulas for s,, and m, as in (56) for v = 0, 1, 2. 


2. Let W,(A), W,(A),... be the sequence of Witt rings defined by the commutative ring A of 
characteristic p. Ifn > m, we have the homomorphism R” ~~” of W,,(A) into W,,(A). Show 
that these rings and homomorphisms define an inverse limit W(A) =" W (4). Show that 


W(A) is isomorphic to the ring of Witt vectors of infinite rank defined to be the set of 
infinite sequences a = (dp, 4 a,...), a, E A, with the addition and multiplication given 


by (49), 0 = (0, 0,...), 1 = (1, 0, 0,...). Show that N = {(0, a; a,...)} is an ideal in W(A) 
contained in the Jacobson radical. 


3. Let A be a perfect field of characteristic p (BAI, p. 233). Show that p WA) = VWA) 
where V(dp, a; a>,...) = (0, ag, a, a>,...). Show that any element of W(A) has the form 


p*a where a is a unit and hence show that W(A) is a domain. 


3.11 ABELIAN p-EXTENSIONS 


We recall first the Artin-Schreier construction of cyclic extensions of dimension p of a field F 
of characteristic p: Let £ be an element of F that is not of Ihe form o? — a, a e F, and let p be 
an element of the algebraic closure F of F such that p? — p = p. Then F(p) is a cyclic extension 
of F of dimension p and every such extension is obtained in this way. The proof makes use of 
the additive analogue of Hilbert’s Satz 90, which can be regarded as a result on cohomology. 
We shall now give the extension of these results to abelian p-extensions and we consider first 
the extension of the cohomology theorem. 

We suppose first that E is a finite dimensional Galois extension field of a field F of 
characteristic p and we form the ring W,,(E) of Witt vectors of length m > 1 over E. We have 


the subring W,,(F) of E and we have an action of the Galois group G of E/F defined by 


(69) S(P0sP15+++sPm—1) = (SPo,SP15-++55Pm—1)- 
This is an action by automorphisms and the subring of elements fixed under every s e G = Gal 
EIF is W_(F). 


If p = (Po, Pis--- Pm— 1) E W,,(E), we define the trace T(p) = È}, e gsp. Evidently T(p) € 
WAL). Since (p + o)o =Po ay 00 for p = Postel ae D and o = (Go,-++ On — > Tp) 7 (TrAPo), 
...). We shall require the following 


LEMMA 1. There exists ap € W (E) such that T(p) is a unit in W (F). 


Proof. By the Dedekind independence property of distinct automorphisms of a field, there is a 


Po E€ E such that T(po) £ 0. Let p = (Po,...). Then T(P) = (Tepo)... -). Since Teppo) £ 9, it 
is a unit in F. Hence, by the Corollary to Theorem 8.28, 7(p) is a unit in W,,(F). 0 


We shall use this result to prove the following: if we regard the additive group (W,,(£) +) 


as G-module under the action we defined, then the cohomology group H'(G, W,,(E)) = 0. In 
explicit form the result is 


THEOREM 8.29. Lets ~ u, be a map of G into W,(E) such that uy, = sp, + Uy. Then there 
exists ao E€ W,,(E) such that u, = so, s € G. 


Proof. The proof is identical with that of the special case in which m = 1, which was treated 


in Theorem 4.33 of BAI (p. 298). Choose p € W,,(£) so that 7(p) is a unit in W,,(F) and put t = 
TOY ' Yee clttp) Then 


T—ST = To? F (Halstp)— sulstp) ) 


= T(p) (1 » v) 


t 


= Hs. 


Hence if o = —1, then u, = so — o. D 


We recall that the Frobenius map p = (pọ P1>---> Pm- 1) ~ Po = PP, PP»---> Pm- 1P) iS an 
endomorphism of the ring W,,(£). We now introduce the map # defined by 


(70) P(p) = Pp—p. 


It is clear that this is an endomorphism of the additive group of W,,(£) (but not of the ring 
W,(£)). The kernel of # is the set of vectors p such that p,? = p,,0 < v < m — 1. This is just the 
set of vectors whose components are in the prime field of E. We have seen (Theorem 8.27, p. 
503) that this set of vectors is the prime ring of W,,(£) and can be identified with 2/(p”), 
whose additive group is a cyclic group of order p”. If G is any finite abelian group of exponent 


pë < p™, then the argument on p. 494 shows that the character group ĉ can be identified with 
the group of homomorphisms of G into the cyclic additive group of the prime ring of W,,(£). 


We shall make this identification from now on. 
We now assume that the Galois group G of the extension E/F is abelian of order p’ and we 
choose m so that p” > p°, the exponent of G. We introduce the following subset of W,,(£): 


(71) SW,,(E) = 1p E W,,(E)| Pp € W,,(F)}. 


This is a subgroup of the additive group of W,,(£) containing W,,(F). Ifp € SW,,(E), we define 
a map xp of G by x,(s) = sp — p. Then P, £) =i yl, sr yor Shee sp tap, 


—pt+sp—p, = SP —P = X,(S) or xp(s) = 0. Then x, (8) is in the prime ring of W,,(E). Also we 
have y (st) = stp - p = sip — tp + tp - p = sp —p + tp — p = xp(s) + x,(2). Thus x, is ° 
homomorphism of G into the additive group of the prime ring of W,,(£) and so may be regarded 
as an element of the character group 6. Next let p, o e SW,(£). Then p + o € SW,(E) and y, + 
pS) = S(p + o) —(p + 0) = Sp -p + So- 0 = XCS) + x,(s). Thus p ~X, is a homomorphism of 
the additive group SW,,(£) into ê. If x,(s) = 0 for all s, then sp = p for all s and p = a € 


W,(E). Hence W,,(F) is the kernel of p ~ y, Finally, we note that this homomorphism is 
surjective. For, lety € @. Then y(s) is in the prime ring and y(st) = y(s) + y(t) = sy(t) + y(s). 
Then, by Theorem 8.29, there exists ap e W,,(E) such that y(s) = sp — p. Since py(*) = x(°y we 
have s(pp — P) = Pp — P, $ EG. Then ? p € W,,(F) and so pe SW,,(E) and xp = x. This proves 
the surjectivity of y ~ x,. We now have SW,,(£)/W,,(F) = ê = G. We have therefore proved the 


first two statements of the following theorem, which is a perfect analogue of Theorem 8.23 of 
the Kummer theory: 


THEOREM 8.30. Let F be a field of characteristic p + 0, E an abelian p-extension field of 
F whose Galois group G is of exponent p°, and let W,,(E) be the ring of Witt vectors over E 
of length m where m > E. Let SW (E) be the additive subgroup of W,,(E) of p such that ? p = 
Pp—P E€ W,(F). Then we have the exact sequence 


(72) 0> W,(F) © SWy(E) > G > 


where « denotes inclusion and č is p ~œ %, where x,(s) = Sp — p The factor group 
SW (E)/W,,(F) = G. The field E/F is generated by the components of the lectors p e SW, (E) 
and 


(1) (1) (2) (2) j”) (r) 
BSF ying te abe ve De ase Vy seeesPm=1 


if and only if cosets p + W (F), p© = (ppy.--5 PO n1), generate SW,,(E)/W,,(F). 

The proof of the last statement is exactly like that of the corresponding statement in 
Theorem 8.23. We leave it to the reader to check the details. 

Following the pattern of our treatment of the Kummer theory we introduce next the set 


(73) QWn(E) = {P(p)| p E€ SW (E). 


This is a subgroup of the additive group of W„(F) containing PWF), the subgroup of vectors 
Pa, ae W,(F). We have the homomorphism 


(74) p> Pp+Pw,lF) 


of SW,,(E) onto OW,,(E)/#W,,(/). An element p is in the kernel of this homomorphism if and 
only if ?p = #a,a e W,(F). This is equivalent to ?(p — a) = 0, which means p — a is in the 
prime ring. Thus the kernel of (74) is W,,(/) and we have the isomorphism 


(75) OW, (E)/PWr(F) = SWr(E)/Wo(F). 


Since the second of these groups is isomorphic to G, the first is also isomorphic to G. We shall 
show next that if Q is any subgroup of the additive group of the ring W,,(/’) containing #W,,(F) 


and O/#W,,(F) is finite, then O = O(W,,(£)) for an abelian p-extension E/F. For this we need 


LEMMA 2. Let P= (Bobis Bn — 1) E€ W,,(F). Then there exist p,,0<v<m-—1, in F such 
that E = F(Po, P5-+-> Pm- 1) is finite dimensional separable over F and the vector p = (Po, P, 
--- Pm — Of WE) satisfies Fp =P. 


Proof. If m = 1, we choose p in F so that pP — p = p. Then Fo) is separable, since the 
derivative (x?—x— /)' =— 1 and hence x? — xp— x — p has distinct roots. Now suppose we have 
Po.» Pm 2 in F so that E' = F(po, P1,---> Pm — 2) is separable over F and A(pp, P1s---> Pm- 2) = 
Pos P1--- Pm- 1) in Wp- (E. Consider the polynomial ring E'[x] and the Witt ring W,,(E"[x]). 
Lety = (Po, P1s---» Pm — 2X) in this ring and form 


Py = (Po, pih, Pn—2)X")— (P05 P15++ +s Pm—2s X)- 


We have #y = (Bo, Pi- bm- 2 JE), Axe E'Lx], and since (Bo, Pis- -< Bm — 2X) = OP, piP- 
Pm- 2; X.) it follows from (50) that x? — f(x) +x + y when y e E'. Thus f(x) =x? — x — y and if 
we choose p,,_ ; gF so that f(p,, 1) = 0, then the derivative argument shows that E'(p,, _ 7) is 
separable over E'. Hence E = Fo, Po, ---» Pm- 1) is separable over F. The formulas show that 


ifp = (PoP 1»---» Pm- 1 İS then # p = (Bo, 7...» Bm 1)-G 
We can now prove 


THEOREM 8.31. Let Q be a subgroup of (W,,(F), +) containing ®W,,(F) such that O/# 
W (F) is finite. Then there exists an abelian p-extension E of F such that the exponent of the 


Galois group of EIF is p°, e < m, and QW (E) = Q. 


Proof. Let p®, B..., B be elements of Q such that the cosets BO + # W.,(F) generate O/# 
W,(f). By Lemma 2, F contains a field Æ that is finite dimensional separable over F and is 
generated by elements p, ®® 1 <i<r,0<v<m-—1 such that ?(p®,..., Py 1®) = B®,- -> Pm 
_ ,) in W,,(E). Let E' be the normal closure of E in F so E' is a finite dimensional Galois 
extension of F containing E. We form W,,(£") and let the Galois group G of E'/F act on W,,(E’) 
as before. Ifs € Gand p = (pp, _ ;), then # p = (PÒ gives Asp = BO. Hence Asp 
— p) = 0, so sp — p® is in the prime ring of W,,(E’). This implies that sE c E, s e G. It 
follows that E is Galois over F and hence E' = E. If s, t € G, then sp = p® + yO, pO = pO + 
5 where yO + SY e W (F). Hence tsp = p® + yO + 6 = stp, which implies that G is 
abelian. Also s*p = p® + y, Since W,,(E) is of characteristic p”, this implies that s?” = 1 
hence G has order p? and exponent p° with e < m. Let yi be the character of G defined by p : 
xs) = sp — p®. Then it is clear that yi(s) = 1, 1 < i < r, implies that s = 1. It follows that the 
xi generate ĉ. Hence, if p is any element of W,,(E) such that Pp € W,,(F) then xp = [ x;". This 
implies that p = Y mp + $, B € W,,(F) m; integers. Then 2(p) = mb + (B) € Q. Since p 
is any element of SW (E) this shows that O(W,,(E)) c Q. The converse is clear so the proof is 
complete. O 


The results we have obtained are analogous to the main results on Kummer extensions. They 
establish a 1— 1 correspondence between the abelian p-extensions whose Galois groups have 


exponent p°, e < m, with the subgroups Q of (W,,(F) +) containing #W,,,(F) as subgroup of 
finite index. 


We shall now consider the special case of cyclic p-extensions. We observe first that the 
original Artin-Schreier theorem is an immediate consequence of the general theory: Any p- 
dimensional cyclic extension of F has the form F(p) where 4p = pP —- p =f e F and £ is an 


element such that there exists no a € F satisfying Pa = p. We shall now prove a result that if 
such an extension exists, then there exist cyclic extensions of F of any dimension p™, m > 1. 
For this purpose we require 


LEMMA 3. If 8 = (Bobi Bin — 1) E€ WF), then p”~ 1) e WF) pp”! Be @W,(F) if and 
only if fy € PF. 


Proof. We note first that by iteration of the formula (66) we obtain p”~ ! 8 = (0,..., 0, By?” ~ 


1), Next we write 


(0,...,0, Bo) — (0,...,0, 82" ) = [(0,...,0, Bo) — (0,...,0, 82)] 
+ [(0,...,0,B5) — 0,...,0,89)] + 
+ [(0,...,0,B3” -) — (0,...,0, B>” ')]. 


Evidently the right-hand side is contained in @W,,(F). Hence p”~ /B = (0,... ,0,By?”" ~ New 
W (F) if and only if (0,...0,By))¢ #W,,(F). Suppose that this is the case, say, (0,..., 0, 89) = pa — 
a for a = (Op, Ajs... &, JE W,,(F). Applying R to this relation gives PR, — Ra = 0 where Ra 
= (Oo,.-+5 OG» — 2). Hence if y = (9,04. . -0n — 7,0), then P= and if ô = a — y, then Pô — ô = (0, 
...,0,Yo). Moreover, since Rd = Ra — Rò = 0, ò = (0,..., 0, 5,, — 7). Then the formula (50) 
applied to Pô = ô = (0,..., 0, Ao) implies that #6,,_ | = ap, SO Po E PF. Conversely, if this 
condition holds, then #6 = (0,..., 0, 69) for ô = (0,..., 0, 5,,,_ 1) and hence p”~ Re PW (F). D 


We can now prove 


THEOREM 8.32. Let F be a field of characteristic p # 0. Then there exist cyclic 


extensions of p” dimensions, m > 1, over F if and only if there exist such extensions of P 
dimensions. The condition for this is PF £ F. 


Proof. We have seen that there exists a cyclic extension of p dimensions over F if and only if 
F 4 PF. Suppose that this conditions holds and choose fy € F, Po € F. Let B = (Po, P2.. Om- 


1) where the £,, i > 0, are any elements of F. By Lemma 3, p”~/ B ¢ @W,,(F). This implies that 
the subgroup Q of (W,,(F) +) generated by £ and PW (F) has the property that O/#W,,(F) is 


cyclic of order p”. By Theorem 8.31, Q = OW,,(E) for an abelian p-extension E/F. Moreover, 
we have seen that the Galois group G of E/F is isomorphic to O/#W,,(F). Hence this is cyclic 


of order p” and E/F is cyclic of p” dimensions. 0 


EXERCISE 


1. Show that if 8 € W,,(F) satisfies p”~ | 8 e #W,,(F), then there exists a y € W,,(F) such 
that py = B(#W,,(F)). Use this to prove that any cyclic extension of p” ~ ! dimensions 


over F can be embedded in a cyclic extension of p” dimensions over F. 


8.12 TRANSCENDENCY BASES 


A finite subset {a,..., a,}, n = 1, of an extension field E/F is called algebraically dependent 
over F if the homomorphism 


(76) Bivona) n Fiais 


of the polynomial algebra F[x,,..., x,,], x; indeterminates, into F has a nonzero kernel. In other 
words, there exists a non-zero polynomial F(x),...,x,,) such that F(a), ..., a,) = 0. Evidently if 
{a)...,a,}, 1 < m <n, is algebraically dependent, then so is {aj...,a,}. We shall now say that 


an arbitrary non-vacuous subset of E is algebraically dependent over F if some finite subset 
has this property. We have the following criterion. 


THEOREM 8.33. A non-vacuous subset s of an extension field E/F is algebraically dependent 
over F if and only if there exists an aéS that is algebraic over F(S — {a}). 


Proof. If T is a subset of E and a is algebraic over F(T), then a is algebraic over F(U) for 
some finite subset U of T. It follows that it suffices to prove the theorem for S finite, say, S = 
{a,... ap}. Suppose first that S is algebraically dependent. We shall prove by induction on n 


that there exists a; a; such that a; is algebraic over F(S;), S; = S— {a;}. This is clear from the 
definitions if n = 1, so we assume that n > 1. We may assume also that {q,..., a,_ 1} iS 
algebraically independent (= not algebraically dependent). Then we have a polynomial f(x,..., 
x,) #0 such that f(a,..., a,) = 0. Write fx),....x,,) =fo(x1,....x,,) £ 0 such that f(a),...,a,,) = 0. 
Write fly..--%_) “SE b- -Xn DEn” tA pe Xn Dh Fe H SpE. An 1) With for, 
Xn 1) £ 0. Then fo(aj,...,a, _ 1) £ 0. so g(x) = fo(ay,..., an — x," + filaj- G — 1) £O. 
Then f(a- an — 1) 80 SX) = fajo- Ap — 1) + f(a- Uy Sp” oe + Eplay 
1) is non-zero polynomial in F(a)..., a, — [x] such that g(a,) = 0. Hence a, is algebraic over 
F(aj,..., ap 1) 

Conversely, suppose one of the a; is algebraic over S; = s — {a;}. We may assume that 7 = n. 
Then we have elements 5y,..., b, € F(a)... d, _ 1) such that g(a,,) = 0 for g(x) =x” + bx” ! + 
... by E F(a ..., ap 1)[x]. There exist polynomials fo(x)...,%, -1 AE Xn Dee An 
5X, D E E [xi Xn] with fo(a,..., a, 1)# 0 such that b; = f(ay,..., a, — 1) Jolais- -> Ay — 
D- |. Then if we put 


ivona Ml FFAS icy e Ie TIM Mee hfe one ob falios Xai) 


we shall have F(x),...,x,) #0 and f(a,,..., a,) = 0. Thus S is algebraically dependent over F. 
o 


We shall now introduce a correspondence from the set E to the set P(E) of subsets of E, 
which will turn out to be a dependence relation on F in the sense defined on pp. 122—123. This 
is given in 


DEFINITION 8.1. An element a of E is called algebraically dependent over F on the subset 
S (which may be vacuous) if a is algebraic over F(s). In this case we write ax S. 


Using this definition, Theorem 8.33 states that a non-vacuous subset S is algebraically 
dependent over F if and only if there exists ana € S that is alge-braically dependent over F on 
S— {a}. We shall now show that the correspondence < between elements of £ and subsets of E 
satisfies the axioms for a dependence relation, that is, we have 


THEOREM 8.34 The correspondence < of E to Z(E) given in Definition 8.1 is a 
dependence relation. 


Proof. The axioms we have to verify are the following: (i) Ifa € S, then a < S. (ii) Ifa < S, 
then a < U for some finite subset U of S. (iii) Ifa < S and every b € S satisfies b < T, then a < 
T. (iv) Ifa < Sand a x S— {b} for some b in S, then b < (S— {b}) U {a}. Axiom (i) is clear 
and (11) was noted in the proof of Theorem 8.33. To prove (111) let A be the subfield of E of 
elements algebraic over F(T). Then S c A and if a < S, then a is algebraic over A. Hence a € 
A, which means that a < T. To prove (iv) let a < S, a x S— {b} where b e S. Put K = F(T) 
where T = s — {b}. Then a is transcendental over K and algebraic over K(b). Hence, by 
Theorem 8.33, {a, b} is algebraically dependent over K, so there exists a polynomial f(x, y) # 
0 in indeterminates x, y with coefficients in K such that f(a, b) = 0. We can write f(x, y) = 


Ag(x)y™ + a(x)y"~! +... + a, (x) with a(x) € K[x] and agx) + 0. Then agla) + 0, so fla, y) + 


0 in K[y], and f(a, b) = 0 shows that b is algebraic over K(a) = F(T O {a}). Hence b < TU 
{a} as required. O 


We can now apply the results that we derived on dependence relations to algebraic 
dependence. The concept of a base becomes that of a transcendency base, which we define in 


DEFINITION 8.2. Jf E is an extension field of F a subset B of E is called a transcendency 
base of E over F if(1) B is algebraically independent, and (2) every a € E is algebraically 
dependent on B. 


The two results we proved in the general case now give 


THEOREM 8.35. E/F has a transcendency base and any two such bases have the same 
cardinality. 


It should be remarked that B may be vacuous. This is the case if and only if E is algebraic 
over F. The cardinality |B| is called the transcendency degree (tr deg) of E/F. A field E is 
called purely transcendental over F if it has a transcendency base B such that E = F(B). 


EXERCISES 
1. Let E> K DF. Show that tr deg E/F = tr deg E/K + tr deg K/F. 


2. Show that if char F £ 3 and E = F(a, b) where a is transcendental over F and a> + b? = 1, 
then F is not purely transcendental over F. 


3. Let € be the field of complex numbers, Q the rationals. Show that tr deg € = |C|. Show 
that if B is a transcendency base of €/Q, then any bijective map of B onto itself can be 
extended to an automorphism of €/@. Hence conclude that there are as many 
automorphisms of €/Q as bijective maps of € onto €. 


4. Show that any subfield of a finitely generated E/F is finitely generated. 


5. Let E = F(xy..., Xm) where the x; are algebraically independent. Call a rational 
expression f = gh ~ ! homogeneous of degree m (eZ) if g is a homogeneous polynomial 
of degree r, h is a homogeneous polynomial of degree s, and r — s = m. Show that the set 
Eo of homogeneous rational expressions of degree 0 is a subfield of F that is purely 


transcendental of transcendency degree m — 1 over F. Show that E is a simple 
transcendental extension of Ep. 


8.13 TRANSCENDENCY BASES FOR DOMAINS. AFFINE ALGEBRAS 


Let D be a commutative domain that is an algebra over a field F E, the field of fractions of D, 
so F CD CE. Evidently, since F(D) is a subfield of E containing D, F(D) = E and hence D 
contains a transcendency base for E/F (see the comment (ii) on bases on p. 124). We call trdeg 
E/F the transcendency degree of D/F. This is an important concept for studying 
homomorphisms of domains that are algebras over the same field F. For, we have the 
following 


THEOREM 8.36. (i) Let D/F and D'/F be domains and suppose there exists a surjective 
homomorphism ņ of D/F onto D'/F. Then tr deg D/F > tr deg D'/F. (ii) Moreover, if tr deg 
D/F = tr deg DF = m < œ then n is an isomorphism. 


Proof. (i) Let B’ be a transcendency base for D'/F. For each x' € B' choose an x € D such that 
nx =x". Then C= {x} is an algebraically independent subset of D. Hence, C can be augmented 
to a base B for E/F, E the field of fractions of D (see comment (7) on p. 124). Hence 


trdeg D/F = |B| > |C| = |B'| = tr deg D'/F 


(ii) Now let B'= {x'),...,x',,$, C= {X1,..-,X%,} where yx; = x';. Since C is an algebraically 
independent set of cardinality tr deg E/F, B = C is a transcendency base for D/F. Let a be a 
non-zero element of D. Then a is algebraic over F(x),..., Xm). Let m(A) =A" — aA” 7 1 +... + 
An, Qi E F(x, ..., Xm) be the minimum polynomial of a over F(x|..., Xn). Since a + 0, a, # 0. 
We can write a = 9%},...5 Xm- - -Xm | Where {X},..-5 Xm), X1- Xm) © Fl.. Xml: 
Then we have 


JolX15+++5Xm)a" T ili ti + DAX is -s Xa) = 0 


and hence 


(77) GalL- -s Xa (Na) + °° + p(X) +++ Xm) = O. 


Since a, #0, g,,(x,.--,X,,) # 0 and since the x'; are algebraically independent g,,(x'),..., X'm) = 
0. Then by (77), na # 0. Thus a + 0 > na + 0 and q is an isomorphism. O 


We prove next the important 


NOETHER NORMALIZATION THEOREM. Let D be a domain which is finitely 
generated over a field F, say, D = F[u,...,u,,]. Let tr deg D =r < m. Then there exists a 


transcendency base {v;} such that D is integral over F[u,..., u,]. 


Proof. The result is trivial if m =r so suppose m > r. Then the u; are algebraically dependent. 
Hence there exists a non-zero polynomial 


Heat PED Mi SN wn 


in indeterminates x; with coefficients in F such that f(u)..., u,,) = 0. Let X be the set of 
monomials xy... x"; occurring in F (with non-zero coefficients). With each such monomial 
xf! ... x,Jm we associate the polynomial j,; +j>f+ ... +,” | € Z [t], t an indeterminate. The 
polynomials obtained in this way from the monomials in_X are distinct. Since a polynomial of 
degree n in one indeterminate with coefficients in a field has at most n zeros in the field, it 
follows that there exists an integer d > 0 such that the integers j, + jad +... + j,,d”~ | obtained 


m -— 1 


m 
from the monomials in X are distinct. Now consider the polynomial fx}, x)“ + y>,..., x14 
Vm) Where y>,..., Ym are indeterminates. We have 


m= iif, > A amet , Wm 
EEE kee "+ y,) = kes + yoy?...(X4 + Vel 
= To a eli ah (X15 Yasso Vn) 


7 > m-l 
xpi tdt- tim” "| Hence for a 


where the degree of g in x, is less than that of 2 ee 


E m—1 è x i z 3 s 
suitable Be F*, B fx, x4% +y....x)% + Yp) is monic as a polynomial in x, with coefficients 


j — g m-— 1 
di-1 2 <i <m we have ņnflup uf + wz... U? +wp) = 


in FLy,..-5 Yml. Lf we put w; = u;— u 
0 which implies that u; is integral over D' = F[w3,..., Wm]. By induction on the number of 
generators, D' has a transcendency base {v,} such that D' is integral over F[v,..., v,]. Then D 


is integral over F[v,..., v,] by the transitivity of integral dependence. O 


A commutative algebra that is finitely generated over a field is called an affine algebra. 
Such an algebra is Noetherian (Corollary to the Hilbert basis theorem, p. 421). We recall that 
the Krull dimension of a Noetherian ring is defined to be Sup s for chains of prime ideals Pp ? 


P? ... ? P, in R. We are now ina position to prove the following theorem on dimension of 
an affine domain. 


THEOREM 8.37. Let D be an affine domain of transcendency degree r over F. Then the 
Krull dimension dim D > r and dim D =r if F is algebraically closed. 


Proof. By Noether’s normalization theorem we may write D = F[uy,..., Up, Up + qo--+) Uml] 
where the u;. 1 <i < r, constitute a transcendency base and the remaining u; are integral over 
Fluy,..., u,]. Then F[uy,..., u,] is factorial and hence is integrally closed in its field of 
fractions. Under these circumstances we can apply the “going-down” Theorem 7.1 to show that 
dim D = dim F[u,...,u,]. First, let pp ? pı ? ... ? p, be a strictly descending chain of prime 
ideals in F[w,..., u,. By the lying-over Theorem 7.5, there exists a prime ideal Py in D such 
that P°o = Po N F[uy,...,u,.] = Po. By Theorem 7.6, there exists a prime ideal P} in D such that 
P; = F[u; and u,] ? P}. Then Py # P}. Then by induction we obtain a chain of prime ideals Po 
? Pi? ... ? P, such that P, O F[uy,...,u,] =p; 0 < i< s. This implies that dim D > dim 
Flu,..., u,]. Next let Py) ? P, #...% Pp for prime ideals P; in D. Then, by Corollary 2 to 
Proposition 7.17 (p. 410), po ? pı? ... ? Py, Pp; = P*; is a properly descending chain of prime 
ideals in F[u..., u,]. It follows that dim F[w,...,u,.] = dim D. Hence dim D = dim F[w),...,u,.]. 
Now we have the chain of prime ideals 


(Uis... st) P (Uest) Bo’ ? (u,) 2 (0) 


in F[u,..., u,.]. Hence dim D = dim F[uw..., u ,] > r = tr deg D/F. On the other hand, we have 
shown earlier (p. 453) that if F is algebraically closed then dim F[u,..., u,] for algebraically 
independent u; is r. This concludes the proof. O 


EXERCISE 


1. Use the Noether normalization theorem to prove the Corollary to Theorem 7.15 (p. 426). 


{Sketch of proof. Let m be a maximal ideal in F[x,..., x,], x; indeterminates, F 
algebraically closed. Then F[x),..., x,]/M is a field that is an affine algebra F[ X ,..., X 
nls X ; =x, + M. By the Noether normalization theorem and Proposition 7.17, tr deg F[x,, 
..+) X,|/M = 0. Since F is algebraically closed, F[x,..., x, ]/M = F. Then X ;=a;¢€ F, 1 
<i<n, and M=(x,—a,...,x, —d,)}. 


8.14 LUROTH'S THEOREM 


The purely transcendental extension fields E/F, especially those having a finite transcendency 
degree, appear to be the simplest type of extension fields. It is clear that such a field is 
isomorphic to the field of fractions F(x,,..., x,) of the polynomial ring F[x,...,x,] in 


indeterminates x},...,x,,. Even though these fields look quite innocent, as noted in BAI (p. 270), 
there are difficult and unsolved problems particularly on the nature of the subfields of F(x,, 
...,X,)/F. A problem of the type mentioned in BAI, which, as far as we know remains unsolved 


(although it was stated as an exercise in the first edition of the author’s Lectures in Abstract 
Algebra vol. Ill (1964), p. 160), is the following: Let the alternating group a, operate on F(x), 


...X,) by automorphisms of this field over F so that z x; =x,(), 1 <i < n, for z € A, and let Inv 
A, be the subfield of fixed points under this action. Is Inv A, purely transcendental over F? 

The one case where the situation is quite simple is that in which F has transcendency 
degree one. We consider this case. 


Let E = F(t), t transcendental, and let u € E, ¢ F. We can write u = f(t)g(t) | where f(2), 
g(t) € Fit] and (AÐ, g(t))= 1. If n is the larger of the degrees of F(t) and g(t) then we can 
write 


f(t) = agotat + ant" 


g(t) = bo+bit+ =: + brt", 
4; b; € F, and either a, or b, #0. We have f(t) — ug(t) = 0, so 


(78) (d,—Ub,)t" +(Gy— 1 —Uby—1)t"” 1 + °°: +(dp—ubo) = 0 


and a, — ub, #0 since either a, # or b, #0 and u ¢ F. Thus (78) shows that ¢ is algebraic over 
F(u) and [F(t):F(u)] < n. We shall now prove the following more precise result. 


THEOREM 8.38. Let E= F(t), t transcendental over F, and let u e F(t), € F. Write u = 
ADAH! = 1 where (f(t),g(t)) = 1, and let n = max (deg F(t), deg g(t)). Then u is 
transcendental over F, t is algebraic over F(u), and [F(t): F(u)] = n. Moreover, the 
minimum polynomial of t over F(u) is a multiple in F(u) of f (x, u) = fx) — ug(x). 


Proof. Put f(x, y) = f(x) — yge(x) € Fix, y], x, y indeterminates. This polynomial in x and y is of 
first degree in y and it has no factor h(x) of positive degree since (f(x),g(x))= 1. Hence it is 
irreducible in F[x, y]. Now ¢ is algebraic over F(u) so if u were algebraic over F, then t 
would be algebraic over F, contrary to the hypothesis. Hence u is transcendental over F. Then 
F |x, u] = F[x, y] under the isomorphism over F fixing x and mapping u into y and hence /(x,u) 
is irreducible in F[x, u]. It follows that F(x, u) is irreducible in F(u)[x] (BAI, p. 153). Since 
f(t, u) = F(t) — ug(t) = 0, it follows that F(x, u) is a multiple in F(u) of the minimum 
polynomial of t over F(u). Hence [F(t):F(u)] is the degree in x of F(x, u). This degree is n, so 
the proof is complete. D 


A first consequence of this theorem is that it enables us to determine the elements u that 
generate F(t). These have the form u = F(t)g(t) ! where f(t) and g(t) have degree 1 or 0, (A®, 
g(t)) = 1, and either ft) or g(t) € F. Then 


(19) at+b 
u = —— 
ct+d 


where a, b, c, d e F, either a + 0 or c £ 0, and at + b and ct + d have no common factor of 
positive degree. It is easily seen that this set of conditions is equivalent to the single condition 


(80) ad —be # 0. 


Now if F(u) = F(t), then we have a uniquely determined automorphism of F(t)/F such that t ~ 
u and every automorphism is obtained in this way. 
The condition (80) holds for the matrix 


81) dmt > 


c d 


if and only if A is invertible. Now consider the linear group GL,(F') of these matrices (BAI, p. 
375). Any matrix A € GL,(F) as in (81) determines a generator u = (at + b)/(ct + d) of F(t) 
and hence determines the automorphism 7(A) of F(t)/F such that 


at +b 
+d 


(82) f(t) ofl =f 


If 


€ GL,(F), then 

„~ (dat+b'c\t+(a’b+b’'d) 
83 (An A t = ———————_ 

(83) MAMA) (’at+d'c)t+(cb+d'd) 


= ņn(A'A)t. 


Since any automorphism of F(t)/F sends ¢ into a generator, 7 is surjective. Hence by (83), 7 is 
an anti-homomorphism of GL,(F) onto Gal F(t)/F. The kernel consists of the matrices A as in 


(81) such that ((at + b)(ct + dY ! = t or at+b=ct* + dt. This gives c= b = 0 and a = d. Hence 
the kernel is the set of scalar matrices a,, a # 0. The factor group GL,(F)/F*1 is called a 
projective linear group and is denoted as PGL,(F). Hence Gal F(t)/F is anti-isomorphic to 
PGL,(F) and since any group is anti-isomorphic to itself (under g ~ g` '), we also have Gal 
F(t)/F = PGL,(F). 

One can determine all of the subfields of E/F for E = F(t), t transcendental: These have the 
form F(u) for some u. This important result is called 


LUROTH’S THEOREM. /f FE = F(t), t transcendental over F, then any subfield k of E/F, 
K £F, has the form F(u), u transcendental over F. 


Proof. Letve K, F. Then we have seen that ¢ is algebraic over F(v). Hence ź is algebraic 
over K. Let f(x) =x" + k,x"~!+... +k, be the minimum polynomial of t over K, so the k; € K 
and n = [F(t):K]. Since ¢ is not algebraic over F, some k; ¢ F. We shall show that K = F(u), u 
=k,. We can write u = g(t)h(t) l where gft], A(t) € FIt, (e, AA) = 1, and m = max (deg h, 
deg 2) > 0. Then, as we showed in Theorem 8.38, [E : F(u)] =m. Since K > F(u) and [E : K] 


=n, we evidently have m > n and equality holds if and only if K = F(u). Now t is a root of the 
polynomial g(x) — uh(x) e K[x]. Hence we have a g(x) € K[x] such that 


(84) g(x)—uh(x) = q(x) f(x). 

The coefficient k; of f(x) is in F(Z) so there exists a non-zero polynomial c(t) of least degree 
such that co(t)k; = c(t) € Flt] for 1 <i<n. Then cp (f(x) = f(x, t) = co(t)x" + e (Ax! +... + 
c,(t) e F[x, t], and f(x, t) is primitive as a polynomial in x, that is, the c(t) are relatively 
prime. The x-degree of f(x, t) is n and since k; = g(t)h(t)” l with (g(4), h(t) = = 1, the t-degree 
of f(x, t) is > m. Now replace u in (84) by g(fA(t) ! and the coefficients of g(x) by their 
expressions in t. Then (84) shows that f(x, t) divides g(x)h(t) — g(f)h(x) in F(2)[x]. Since f(x, t) 
and g(x)h(t) — g(t)h(x) € Fx, t] and f(x, t) is primitive as a polynomial in x, it follows that 
there exists a polynomial g(x, t) e F[x, t] such that 


(85) g(x)h(t)—g(t)h(x) = f(x, t)q(x, t). 


Since the ¢-degree of the left-hand side is < m and that of F(x, t) is > m, it follows that this 
degree is m and q(x, t) = g(x) € Fix]. Then the right-hand side is primitive as a polynomial in 
x and so is the left-hand side. By symmetry the left-hand side is primitive as a polynomial in t 
also. Hence g(x) = q € F. Then F(x, t) has the same x-degree and ¢-degree so m = n, which 
implies that K = F(u). D 


We shall now indicate some of the results that are presently known on subfields of purely 
transcendental extensions of transcendency degree greater than one. We use the algebraic 
geometric terminology in which a purely transcendental extension E/F is called a rational 
extension and a subfield of such an E/F is called unirational. In BAI (p. 270) we have noted 
some results and given some references on unirational fields of the form Inv G where G is a 
finite group of automorphisms of a field F(x,..., x,,)/F where the x; are indeterminates that are 


permuted by G. Further results on the rationality and non-rationality of fields of the form Inv G 
are given in a survey article by D. J. Saltman, “Groups acting on fields: Noether’s problem” in 
Contemporary Mathematics vol. 43, 1985, pp. 267—277. 

An old result on subfields of rational extensions of transcendency degree two is the theorem 
of Castelnuovo-Zariski: if F is algebraically closed of characteristic 0 then any subfield L of a 
rational extension F(x,,x>) such that F(x,,x>) is affine over L is rational. The result does not 
always hold for characteristic p # 0. (See R. Hartshorne’s Algebraic Geometry, Springer- 
Verlag, New York, 1977, p. 422.) 

Examples of non-rational subfields of rational extensions of transcendency degree 3 over € 
are given in the following papers: 

1. M. Artin and D. Mumford, “Some elementary examples of unirational varieties that 
are not rational,” Proc. London Math. Soc. vol. 25, 3rd ser. (1972), pp. 75-95. 

2. C. H. Clemens and P. A. Griffiths, “The intermediate jacobian of the cubic threefold,” 
Annals of Math. (2) vol. 95 (1972), pp. 281-356. 

3. V. A. Iskovkikh and J. Manin, “Three dimensional quartics and counterexamples to the 
Luroth problem,” Math. Sbornik, vol. 86 (1971), pp. 140-166. 

A. Beauville, J. L. Colliot-Thelene, J. J. Sansuc, and Sir P. Swinnerton-Dyer in “Varietes 
stablement rationelles non rationelle,” Annals of Math. (2) vol. 121, pp. 283-318 have given 
an example of an extension K/C of transcendency degree three such that a purely transcendental 
extension of transcendency degree three over K is purely transcendental of transcendency 
degree six over € but K is not rational over €. 


EXERCISES 


1. Let F} be a field of q elements and let K be the subfield of fixed elements of F,(¢), t 
transcendental, under Gal F;,(t)/F,. Determine an element u such that K = F(u). 


2. Let E = F[t, v] where t is transcendental over F and v? + £ = 1. Show that E is purely 
transcendental over F. 


The following two exercises sketch proofs due to Mowaffag Hajja (to appear in Algebras, 
Groups, and Geometries) that Im A, and Inv A, are rational. This improves results of Burnside 


published in Messenger of Mathematics, vol. 37 (1908), p. 165. We mention also that it has 
been proved recently by Takashi Maeda (to appear in the J. of Algebra) that Inv 4; for the base 


field Q is rational. The situation for Im A, with n > 5 is still unsettled. 


3. Show that Inv.A3 in K = F(x, x2, x3) is rational. 
(Sketch of Proof: We distinguish three cases: 


i. F contains a primitive cube root of 1, which implies char F £ 3. 
ii. char F £ 3 but F contains no primitive cube root of 1. 

iii. char F = 3. 

In all cases 4, is the group of automorphisms of K/F' generated by the automorphism o 
such that x, ~ x2, X2 œ X3, X3 ~ x4. Also Gal K/Inv A3 = A; and [K:Inv A3] = |A3| = 3. In 
case i we put x; =x, + wx + w¥/x3 where w° = 1, w £ 1. Then ox; = w JX, and K = F(X}, 
X», X). Now put Y; = Xy°/X>, h = X/x, Ý = xX3. Then oY, = Y, so F(Y}, Yo, Y3) c Inv 
A3. On the other hand, X? = Y,?Y, and K = F(X, X5, X3) = F(Y}, Y», Y3,X)). Hence [K: 
F(Y,, P, Y3)] < 3. Then Inv 43 = F(Y, Y5, Y3). In case ii we adjoin a primitive cube root 
of unity w to K to obtain k = K(w) = F(x}, x2, x3) for F = F(w). We have w? +w + 1 =0, 
(w, wœ) is a base for k'/ K and for F'/F and we have an automorphism t of k'/K such that 
w ~ w°. As in case i, we define X, = x) + Wx, + w/x3, j = 1, 2, 3, Y) = X/x, % = 
X?,/x,, Y3 =X% =x + x + x3. Then Inv A; = F'(Y;, Y, Y2). We have tY; = Y, I'Y, = Y; 
TY; =Y;€K Hence if Y, = wZ, + w°Z, where Z,eK then Y, = w*Z, + wZ, and Y; = Y, 
implies oZ; = Z; and Inv a3 = F(Z;,Z,Z3). In case iii we let 4 = 0 — 1 and U, = A’x, j = 
0, 1, 2. Then K = F(Up, U;, U2) and Uy = Up + U; cU; = U; + U, oU, = U). Let U = 
UU, + U? — U,U, Then K = F(U, U,, U>), cU = U, oU; = U, + U, and aU, = U}. 
Hence (o((U*, - U,;U,U’) and Inv a3 > F(U, Us, 1° - U)U>7). Since [K:F(U, U>, Up? - 
U,U,7).< 3 it follows that Inv a} = F(U, U>, U? - ULU). (Cf. exercise 1, p. 271 of 
BAL)) 

4. Show that Inv A, inK = F(x, X2, x3, X4) is rational. 


(Sketch of Proof: It is clear that A, is generated by the automorphisms a, p, o of K/F 
such that AX, = Xz, UXy = Xj, Ky = Xy, AX, = Xy; Px, = x5, Bx, =x, 
Bx, = Xa, xq = Xp, OX, = Xz, OX; = Xy, 0X3 = Xy, OX = x, If char F Æ 2 we define 


S= X; +X, + X, + X4 
Zi = X; + X, — X3 — X4 
2, =X = Xy FX = X 
Z3 =X, — X, — X3 + X4 


Then K = F(s, Z1, Z2, z3). The action of a, p, o on (S, 21, Z2, Z3) is given by the following 
table 


Put Y, = Z,23/Z, Yo = oY, = 223/Z), Y3 = y1 = Z,z>/z3. Then F(s, Yi, Y>, Y2) c Inv H 
where H = ¢a.8). Since =} = YiYs, z} = YaYs, F(s, Ya Ya Y2n2)K, and | H |= 4, it follows as before 
that Inv H = F(s, Yj, Y5, Y3). Since os; = s, and oY, = V, oY, = Y3, IY = Y}, the result of 
exercise 3 shows that Inv <*> in F(s, Yi, Y5, Y3) is rational over F(s). It follows that Inv 
A, in K is rational over F. Now suppose char F = 2. Put x =x), y = x1 + x3, Z = x1 T X2, S 
=X, + X3 + x3 + X4, X = xs + yz = xx4 + xx3. Then K = F(x,y,z,s) = F(X, y, z, s) and aX = 
X, ay =y +s, az = z, as = s, PX = X, By = y, fiz =z + s, ps = s. It follows that Inv H in 
K = F(X, s, y @ + s), z(z + 5)). Then Inv H = FX s, y0 + s), z(z + s)). If we put 


Xy = X = X,X4 + XgXy, Xim OX = XX4 + XX, Xam 7X = XyXa + xika, then 
WY + S) = Oxy + xalx + Xa) = Xp t Xy al + 5) = OG Xal +a) = Xa tX Hence. Inv H= F(X, X>, X3, S). 
Then the rationality of Inv A, follows from exercise 3 since as = s, aX, = X, aX, = X, 
aX, =X) 


8.15 SEPARABILITY FOR ARBITRARY EXTENSION FIELDS 


In this section we shall introduce a concept of separability for arbitrary extension fields that 
generalizes this notion for algebraic extensions. This is based on the concept of linear 
disjointness, which we now define. 


DEFINITION 8.3. Let E be an extension field of F A, and B subalgebras of E/F. Then A 
and B are said to be linearly disjoint over F if the canonical homomorphism of A @,B into E 


sending a ® b into ab, a € A, b € B, is a monomorphism. 


It is clear that if A and B satisfy this condition and A’ and B' are subalgebras of A and B 


respectively, then A' and B” satisfy the condition. Let K and L be the subfields of E/F generated 
by A and B respectively. Then A and B are linearly disjoint over F if and only if K and L are 
linearly disjoint over F. To prove this it suffices to show that if k}... K, are F-linearly 
independent elements of K and /,...,/,, are F-linearly independent elements of L, then the 
elements kil;, 1 <i<m,1l<j<n,are linearly independent over F. This follows from the linear 
disjointness over F of A and B by writing k,=a;=a4;a !, apa € A,1<is<m,l,=b,b '=b,,b 
€ B, 1 <j <n. Conversely, the linear disjointness of K and L over F implies that of A and B. 
The following result permits establishment of linear disjointness in stages. 


LEMMA 1. Let E, and E, be subfields of E/E a subfield of E,/F. Then E, and E, are 
linearly disjoint over F if and only if the following two conditions hold: (1) K, and E, are 
linearly disjoint over F and (2) K (E2) and £ are linearly disjoint over Kj. 


E, E, (Ez) 


K, (Ey) 


F E, 


Proof. Assume the two conditions. Let (u,) be a base for E,/F. By (1) these elements are 
linearly independent over and, since they are contained in K; (E>), they are linearly independent 
over £, by (2). Hence E} and Æ, are linearly disjoint over F. Conversely assume that E} and £, 
are linearly disjoint over F. Then (1) is clear since K; C £}. Let (u,) be a base for E/K; (vg) a 
base for K;/F, (w,) a base for E,/F. Then (u,v,) is a base for E,/F and since E; and E, are 
linearly disjoint over F, the set of elements {ua VBWy.} is linearly independent. This implies 
that the only relations of the form }'du, = 0 with d; in the subalgebra K,E, generated by K, and 
E, are the trivial ones in which every d; = 0. Since K/E) is the set of elements cd !, c, d € 
K,E,, it follows that (u,) is a set of elements that is linearly independent over K,(£>). Then (2) 
holds. OD 


We now assume that the characteristic is p # 0 and we imbed Æ in its algebraic closure Ë. If 
e > 0, we denote the subset of E. of elements a such that a° e F by F? °. This is a subfield and 
FcCF-!cF’ °c... Hence P~ = U, >; FP? is a subfield. Since linear disjointness is a 
property of finite subsets, it is clear that F” © and E are linearly disjoint over F if and only if 


FP © and E are linearly disjoint over F for every e. A result on separable algebraic extensions 
that we proved before can now be reformulated as 


LEMMA 2. Jf E/F is separable algebraic, then E and FP ~” are linearly disjoint over F. 


Proof. It suffices to show that if a,..., a,, are F-independent elements of Æ, then these are 
linearly independent over FP ~ © for every e. This is equivalent to the following: a,?°,..., a,?° 
are F-independent, which is a property of separable algebraic extensions proved on p. 489. 
Hence F and F” are linearly disjoint over F. D 


We prove next 


LEMMA 3. Jf E is purely transcendental over F then E/F and FP ~” are linearly disjoint 
over F. 


Proof. It suffices to prove the result for F = F(x,..., x,) where the x; are algebraically 
independent. Moreover, the result will follow in this case if we can show that F[x,...,x,,] and 
FP -° are linearly disjoint over F for every e > 0. We have a base for F[x),..., x,,|/F consisting 
of all of the monomials x*j1...x%,, k; > 0. The map m ~ mP“ for the set of monomials is 
injective onto a subset. Hence if (m,) is the base of monomials, then the set {m,?°} is linearly 
independent over F. It follows that {m,} is linearly independent over F? ~ ° and hence F[x; ..., 
x] and FP ~€ are linearly disjoint over F. OD 


An extension E/F is said to be separably generated over F if E has a transcendency base B 
such that E is separable algebraic over F(B). In this case B is called a separating 
transcendency base for E/F. The example of E inseparable algebraic over F shows that E/F 
may not be separably generated. The example of B = {xP } in F(x), x transcendental, shows that 
even if E is separably generated over F, not every transcendency base has the property that F 
is separable algebraic over F(B). 


We can now prove our main result. 


THEOREM 8.39. Let E be an extension field of a field of characteristic p > 0. Then the 
following properties of E/F are equivalent: 


(1) Every finitely generated subfield of E/F is separably generated. 
(2) E and FP ~” are linearly disjoint over F. 
(3) E and F? ~! are linearly disjoint over F. 


Proof. (1)= (2). To prove this we suppose that E is separable algebraic over F(B), B a 
transcendency base of E/F. By Lemma 3, FP and F(B) are linearly disjoint over F. By 
Lemma 2, E and F(B)’* are linearly disjoint over F(B). Since F(B)’-” > FP” (B) it follows 
that FP-°(B) and E are linearly disjoint over F (B). Then, by Lemma 1, E and F? ~' are linearly 
disjoint over F. Since this is a property of finite subsets, the result proved shows that (1)>(2). 


(2) > (3) obviously. 
(3) = (1). Assume that E and F? ~' are linearly disjoint over F and let K = F(ay...,a,,) be a 


finitely generated subfield of E/F. We prove by induction on n that we can extract from the 
given set of generators a transcendency base a,,...,a;, (where r is 0 if all the a; are algebraic 


over F) such that K is separable algebraic over F(ai,,..., a;). The result is clear if n = 0, so 
we assume n > 0. The result is clear also if a,..., a, are algebraically independent. Hence we 
assume that a),...,a,, 0 < r < n, is a transcendency base for K/F. Then a, ..., a, + 1 are 
algebraically dependent over F, so we can choose a polynomial F(x),...,.x, + 1) #0 € Fix, 

X, + 1] of least degree such that f(a, ... a, . , = 0. Then f(x),...,x, + 1) is irreducible. We 


claim that f does not have the form g(x,’,...,.%” +1), g © FIX.. -X,+ a For g(xP,... X, 4 P) = 
A(x)... X44)? in FPT Testes Xr ¡Jand if Ax]... Xp 1) = g(x)... Xr- i P), ean d, d= = 
0. Let HoN -X -ip 1 <i n u, be the monomials occurring in A. Then the elements ee o; 


_;) are linearly dependent over F? ~ 7 so by our hypothesis, these are linearly dependent over 
F. This gives a non-trivial polynomial relation in a)... a, , with coefficients in F of lower 
degree than f contrary to the choice of f. We have therefore shown that for some i, 1 <i<r+1, 
AXi,- -X,+ 1) is not a polynomial in x? (and the other x’s). Then a; is algebraic over F(q),..., 
Xa;,...,d, + 1) where >; denotes omission of a,. It follows that {a),..., xaj,...,a, + 1} is a 
transcendency base for F(a),...,a,,). Then F[qy,..., Aj 1s X, Gj4 pee @ + 1) = Fix.. X41] in 
the obvious way and hence f(a), ...,d;_ 1X, Gj+1.---, 4 +1) is irreducible in Fla, ..., a; _ 1,X, Aj 
+1- G, + 1]. Then this polynomial is irreducible in F(a, ...,a;...,4, + 1) [x]. Since a; is a root 
of F(a),..-,;_1,X,4i+15---4;4,-) and this is not a polynomial in xP, we see that a; is separable 
algebraic over F(a, ...4;...,4,4,;) and hence over L = F(a, ..., Gj...,a,). The induction 
hypothesis applies to L and gives us a subset {a;,..., a;} of {a)...,a,... ,@,] that is a separating 
transcendency base for L over F. Since a; is separable algebraic over L, it follows that a; is 
separable algebraic over F(qj,...,a;.). Hence {a}, ..., {aq} is a separating transcendency 
base for F(a)...,a,). D 


We remark that the result is applicable in particular to an algebraic extension E/F. In this 
case it states that if Æ and F?~ are linearly disjoint over F, then E is separable and if E is 


separable, then £ and FP” are linearly disjoint over F (which was Lemma 2). This makes it 
natural to extend the concept of separability for arbitrary extension fields in the following 
manner. 


DEFINITION 8.4. An extension field E/F is called separable if either the characteristic is 
0 or the characteristic is P + 0, and the equivalent conditions of Theorem 8.37 hold. 


The implication (3) = (1) of Theorem 8.37 is due to MacLane. It implies an earlier result 
due to F. K. Schmidt, which we state as a 


COROLLARY. /f F is perfect, then every extension E/F is separable. 


Proof. This is clear since F is perfect if and only if the characteristic is 0 or it is p and FP! = 
F.D 


The following grab-bag theorem states some properties and non-properties of separable 
extensions. 


THEOREM 8.40. Let E be an extension field of F, k an intermediate field. Then (1) If E is 
separable over F then k is separable over F. (2) If E is separable over k and k is separable 
over F, then E is separable over F. (3) If E is separable over F, then E need not be 
separable over K. (4) If E is separable over F it need not have a separating transcendency 
base over F. 


Proof. We may assume that the characteristic is p # 0. (1) This is clear since the linear 
disjointness of E and F”? over F implies that of K and FP ~/ over F. (2) The hypothesis is that 
E and KP ~! are linearly disjoint over K and that K and F?~ ! are linearly disjoint over F. Then 
E and K(F®~ ') are linearly disjoint over K since K(F?~ !) c KP- ! Hence, by Lemma 1, E and 
FP" are linearly disjoint over F and so E is separable over F. (3) Take E = f(x), x 
transcendental, and K = F(x?). (4) Take E = F(x, x? ~ |, xP ~ ?,...) where x is transcendental 
oe F. Then E has transcendency degree one over F and EF is not separably generated over F. 


EXERCISES 


1. Let E,/F and E,/F be subfields of E/F such that Ei/F is algebraic and E,/F is purely 
transcendental. Show that E and E, are linearly disjoint over F. 


2. Let F have characteristic p # 0. Let E = F(a,b,c,d) where a,b,c are algebraically 
independent over F and œ = ab? + c. Show that E is not separably generated over F(a, 
c). 

3. (MacLane.) Let F be a perfect field of characteristic p, E an imperfect extension field of 
transcendency degree one over F. Show that E/F is separably generated. 


8.16 DERIVATIONS 


The concept of a derivation is an important one in the theory of fields and in other parts of 
algebra. We have already encountered this in several places (first in BAI, p. 434). We consider 
this notion now first in complete generality and then in the special case of derivations of 
commutative algebras and fields. In the next section we shall consider some applications of 
derivations to fields of characteristic p. 


DEFINITION 8.5. Let B be an algebra over a commutative ring K, A a subalgebra. A 
derivation of A into B is a K-homomorphism of A into B such that 


(86) D(ab) = aD(b)+ D(a)b 
for a, b e A. If A=B, then we speak of a derivation in A (over K). 


Let Derg(4, B) denote the set of derivations of A into B. Then Der,(A, B) c hom,(A, B). If 
D,, D€ Der,(A,B), then the derivation condition (86) for the D; gives 
(Dı + D2)(ab) = a(D, + D2)(b) + (Dı + D2) (a)b. 
Hence D; + D, € Der(A, B). Now let k e K, De Der(A, B). Then 
(kD) (ab) = k(D(ab)) = k(D(a)b + aD(b)) = (kD) (a)b + a(kD(b)). 


Hence kD e Der,(A, B) and so Der;(A, B) is a K-submodule of hom,(A, B). 

Now let B = A and write Der,A for Derg(4, A). As we shall now show, this has a 
considerably richer structure than that of a K-module (cf. BAI, pp. 434435). Let D,, D, € 
Der, a, b e A. Then 


D,D>(ab) = D,(aD2(b)+D>(a)b) 
= aD, Dı(b)+ D,(a)D2(b) + D2(a)D,(b) + Dı D2(a)b. 


If we interchange D, and D, in this relation and subtract we obtain 


(87) [D:D] (ab) = a[D:D:] (b)+ [D:D] (a)b 


where we have put [D,D,] for D,;D, — D,D,. This result and the fact that Der,4 is a K-module 
of End,A amount to the statement that DergA is a Lie algebra of K-endomorphisms of A (BAI, 
p. 434). 

There is still more that can be said in the special case in which K is a field of characteristic 
p #0. We note first that for any K, if D e Der,A, we have the Leibniz formula for D”: 


(88) D"(ab)= 9, (i)D'(a)D"-'(b), 
i=0 


which can be proved by induction on n. If K is a field of characteristic p, then (88) for n = p 
becomes 


(89) D?(ab) = D”(a)b + aD"(b). 


This shows that D? € Der,A. If V is a vector space over a field K of characteristic p, then a 
subspace of End,V that is closed under the bracket composition [D,D,] and under pth powers 


is called a p-Lie algebra (or restricted Lie algebra) of linear transformations in V. Thus we 
have shown that if A is an algebra over a field of characteristic p, then DergA is a p-Lie 


algebra of linear transformations in A over K. 


There is an important connection between derivations and homomorphisms. One obtains 
this by introducing the algebra à of dual numbers over K. This has the base (1,5) over K with 1 


the unit and 6 an element such that 6” = 0. If B is any algebra over K, then we can form the 
algebra B ® , A and we have the map b — b ® 1 of B into B @, å. Since 4 is K-free, this is an 


algebra isomorphism and so B can be identified with its image B ® 1. We can also identify ô € 
4 with 1 ® ô in B k A. When this is done, then B ®ẹx 4 appears as the set of elements 


(90) bi +h, bie B. 


This representation of an element is unique and one has the obvious K-module compositions: 
Moreover, if b; e B, then 


(91) (bı +b2d)(by +b20) = bib; +(b1b2 + b2b1)ð. 


Now let D be a K-homomorphism of A into B. We define a corresponding map a(D) of A 
into B ® x å by 


(92) a(D):a > a+ D(a)ð, 


which is evidently a K-homomorphism. We claim that a(D) is an algebra homomorphism if and 
only if D is a derivation. First, we have D(1) = 0 for any derivation since D(1) = D(17) = 
2D(1). Now 


(x(D)(a) )(a(D)(b)) = (a+ D(a)d)(b+ D(b)d) 
= ab +(aD(b)+ D(a)b)d 
and 
«(D)(ab) = ab + D(ab)é. 
Thus a(D)(ab)) = (a(D)(a))(a(D)(b)) if and only if D is a derivation and a(D) is an algebra 


homomorphism if and only if D is a derivation. 
The homomorphisms a(D) have a simple characterization in terms of the map 


(93) t:b; +b 6 > by, bie B, 


of B ®,à into B, which is a surjective K-algebra homomorphism of B ® gà onto B. Ifa € A, 
then a(D)a = a + D(a) so ma(D)a = a. Hence ma(D) = 14. Conversely, let H be a 


homomorphism of A into B® gA. For any a we write H(a) = a, + aô. This defines the maps a 
* A), a ~œ, ay Of A into B, which are K-homomorphisms. The condition zH = 1, is equivalent to 
a, =a for all a. Hence if we denote a ~ a, by D, then H(a) = a + D(a)ô. The condition H(ab) = 
H(a)H(b) is equivalent to: D is a derivation. 
We summarize our results in 


PROPOSITION 8.15. Let A be a subalgebra of an algebra B and let D be a derivation of 
A into B. Then a(D): a ~ a + D(a)o is an algebra homomorphism of A into B gà such that 


oa(D) = 14. Conversely, any homomorphism H of A into B®,4 such that oH = 1, has the 
form a (D), D a derivation of A into B. 


The importance of this connection between derivations and homomorphisms is that it 
enables us to carry over results on algebra homomorphisms to derivations. In this way we can 
avoid tedious calculations that would be involved in direct proofs of the results for 
derivations. As an illustration we prove 


PROPOSITION 8.16. Let A be a subalgebra of an algebra B, D, D,, D, derivations of A 
into B, x a set of generators for A. Call an element a € Aa D-constant if D, =0. Then 


(1) Di = Dy if X= DX. 
(2) The set of D-constants is a subalgebra of A. Morever, if A is a division algebra, 
then it is a division subalgebra. 


Proof. (1) The condition D,|X = D,|X implies that a(D,)|X = a(D,)|\X for the algebra 
homomorphisms of A into B®,a. Since X generates A, we have a(D,) = a(D>). Hence D, = D}. 


(2) The condition that a is a D-constant is equivalent to: a is a fixed element under the 
homomorphism a(D) of A c B @x4 into B®xA. Since the set of fixed points of a homomorphism 


of a subalgebra A of an algebra C is a subalgebra and is a division subalgebra if A is a division 
algebra, the result on derivations is clear. 0 


We obtain next a formula for D(a ') for an invertible element a of A and derivation D of A 
into B. Since 1 is a D = constant, applying D to aa~ ! = 1 gives 


D(a)a~'+aD(a~') = 0. 
Hence we have the formula 
(94) D(a~') = -a~ tD(a)a t, 


which generalizes the well-known formula from calculus. 
From now on we consider derivations of commutative algebras into commutative algebras, 
that is, we assume B commutative. Let D € Derg(4, B) b € B. If we multiply the relation D(xy) 


= D(x)y + xD(y), x, y € A, by b we obtain 
bD(xy) = bD(x)y + xbD(y). 


This shows that bD defined by (bD)(x) = b(D(x)) is again a derivation. It is clear that this 
action of B on Der;(A, B) endows Der;(A, B) with the structure of a B-module. 


We now consider the problem of extending a given derivation D of a subalgebra A into B to 
a derivation of a larger subalgebra A'. Let a(D) be the corresponding homomorphism of A into 
B®, such that ca(D) = 14. The problem of extending D to a derivation of A' amounts to that of 


extending H = a(D) to a homomorphism H’ of A' into B®,A such that tH’ = 14. Now if H’ is a 
homomorphism of A’ into B®,A extending H, then zH' = 14. will hold if and only if zH'(x) = x 
holds for every x in a set of generators for A over A. We shall use these observations in treating 
the extension problem. 

We now suppose that A' = A[u)..., u,], the subalgebra generated by A and a finite subset 
{Uj,...U,} Of B. Let A[x),...,x,] be the polynomial algebra over A in the indeterminates x; and 
let Z be the kernel of the homomorphism of A[x,,..., x,,] onto A’, which is the identity on A and 
sends x; ~ u; 1 < i < n. Suppose that we have a homomorphism s of A into a commutative 
algebra C and elements v; 1 <i <n, of C. Then we have the homomorphism 


(95) [Pty 0y hy) HUG 04, ot) 


of A[x,..., x,] into C (BAI, p. 124). Here sf denotes the polynomial obtained from f by 
applying s to its coefficients. The homomorphism (95) induces a homomorphism 


(96) ‘ PRR NO e i I ~ (sf )(V1,..+5Un) 


of A[x),..., x, ]/Z into C if and only if (sf) (4,..., Vn) = 0 for every f € I. Since we have the 
isomorphism /(x),...,x,) + Z ~ f(uy,...,u,), we see that we have a homomorphism of A' into C 
extending s and sending u; ~ v,, 1 <i < n, if and only if 


(97) (Pont) = 0 


for every f € L. Moreover, it is clear that it suffices to have this relation for every F in any set 
of generators X for the ideal 7. 

If F(x),...,x,) € A [x},....x,], we write 0f/dx; for the polynomial obtained from f by formal 
partial differentiation with respect to x;. For example 


A 


( ? 2 9 
— (x?x2X3 + 3x1x3) = 3x1X2X3 + 3x3. 
OX, 


If D is a derivation of A into B, then we shall write (Df)(x,...,x,,) for the polynomial in B[x,, 


x, | obtained by applying the derivation D to the coefficients of f 


We can now prove 


THEOREM 8.41. Let B be a commutative algebra, A a subalgebra, £ = A[uy..., u,], uj € 
B, X a set of generators for the kernel of the homomorphism of A[x,,... ,x,,] onto A’ such that a 
~aforae Aandx;~u,, 1 <i<n. Let D be a derivation of A into B. Then D can be extended 
to a derivation of A' into B such that u; ~v; 1 <i <n, if and only if 


(98) (Df)(uy,...,Un)+ ae f ahaa, U,)v; = 0 
i=] OX; 


for every fe X. 


Proof. The condition that D has an extension of the sort specified is that a(D) is extendable to 
a homomorphism of 4' into B®, sending u; ~ u; + vj, 1 <i< n. This will be the case if and 


only if for every fe X 
(99) (a(D) f) (uy +040, U2 +020,...,Un+0,0) = 0 
Now let a € A and consider the monomial avi...» k: 20, We have 

(a+ D(a)d) (uy +015): +++ (un +00)" 

- > | A 
= aut tuk +((Da)uy*! u+ kiau," bug ++ uy vy 
+ kyauy* u ag* Upava oe + kraty" Up- yet” DADO. 

Hence for any fe A[x, ..., x, ] we have 


a(D) f) (uy +046, uz +020, ...,Un +00) = f(Uy,...5 Un) 
+((Df)(u1,..., Un) 
~ (ts .« , Un )Vi)Ò. 


Then the condition that (99) holds for all fe X is equivalent to (98). D 


We suppose next that S is a submonoid of the multiplicative monoid of A and we consider 
the localizations A, and B,. We can prove 


THEOREM 8.42. Let D be a derivation of A into B (commutative) and let s be a 


submonoid of the multiplicative monoid of A. Then there exists a unique derivation D, of a, 
into B, such that 


A A 5 
(100) D D; 
B B; 


is commutative. Here the horizontal maps are the canonical homomorphisms a ~ a/\ and b 
» b/l respectively. 


Proof. We have the homomorphism a(D) of A into B ® gå sending a ~ a + D(a)ô and the 
homomorphism of B®, into Bka sending b/1@u. Hence we have the homomorphism of A 
into B®gå sending such that a ~ a/1 +(D(a)/Da. Now if s e S, then s/1 + (D(s)/L)o is 
invertible with inverse l/s — (D(s)/s*)d. Hence by the universal property of A, we have a 
unique homomorphism H: A, ~ B,®,4 such that 


A A, 
(101) a(D) H 
B@, A B; @, A 


is commutative. If z, denotes the canonical homomorphism of B;®,A onto B, then the 
commutativity of (101) implies that z,H = 1,,. Hence H has the form a(D,) where D, is a 
derivation of A, into B,. Then D, satisfies the condition of the theorem. 0 


We now specialize to the case of fields. We consider an extension field E/F of a field and 
regard this as an algebra over F. Suppose K is a subfield of E/F and we have a derivation D of 
KIF into E/F and a is an element of E. If a is transcendental over K, then Theorem 8.41 shows 
that for any beE there exists a derivation of K[a] into EF extending D and mapping a into b. 
Then Theorem 8.42 shows that this has a unique extension to a derivation of K(a) into E: 
Hence if a is transcendental over K, then there exists an extension of D to a derivation of K(a) 


sending a into b. By Proposition 8.16 (and Theorem 8.42) this is unique. Next let a be 
algebraic over K with minimum polynomial f(x) over K. Then K(a) = K[a] and Theorem 8.41 
shows that D can be extended to a derivation of K(a) into E sending a ~ b if and only if 


(102) (Df )(a)+ f'(a)b = 0. 


If a is separable, (Ax), f(x)) = 1 and f’ (a) # 0. Then there is only one choice we can make for 
fo, namely, 


(103) b= -f'(a) ‘(Df)(a). 


Hence in this case D can be extended in one and only one way to a derivation of K(a) into E. If 
a is inseparable, then F(a) = 0. Then (101) shows that D can be extended to a derivation of 
K(a) if and only if the coefficients of f(x) are D constants and if this is the case, the extension 
can be made to send a into any b € E. We summarize these results in 


PROPOSITION 8.17. Let E be an extension field of F, k an intermediate field, D a 
derivation of K/F into F/F, a an element of E. Then 


(1) D can be extended to a derivation of K (a) into E sending a into any b € E if a is 
transcendental over k. 


(2) D has a unique extension to a derivation of K(a) into E if a is separable algebraic 
over K. 


(3) D can be extended to a derivation of K(a) into E if a is inseparable algebraic over 
k if and only if the coefficients of the minimum polynomial of a over k are D- 
constants. Moreover, if this condition is satisfied, then there exists an extension 
sending a into any be E. 


We now suppose that E is finitely generated over F: E = F(a)...,a,). Let X be a set of 
generators for the ideal J in F/x,..., x,,], x; indeterminates, consisting of the polynomials f such 
that f(a),...,a,,) = 0. Then it follows from Theorems 8.41 and 8.42 that there exists a derivation 
D of E/F into itself such that Da; = b; 1 <i <n, if and only if 


(104) È = au... a)b; = 0 


for every f e X. We recall that Der-F becomes a vector space over E if we define bD for b € 
E, D € DerpE by (bD)(x) = b(D(x)) (p. 525). We wish to calculate the dimensionality [Der;F : 
E] when E = F(a)..., a„). For this purpose we introduce the n-dimensional vector space E™ of 
n-tuples of elements of E. If D € DevpE, D determines the element (Day,..., Da,) of E™ and 
we have the map 


(105) Aqg,,...,0,:D (Day, ..., Day) 


of Der,E into E”), Evidently this is a linear map of vector spaces over E and since a 
derivation is determined by its action on a set of generators, 1 = /,,...,a, is injective. Hence 


[DerpE:E] is the dimensionality of the subspace A,Der,£) of E™. Now let g e FTx}..., X,]- 
Then g defines a map dg of E™ into E by 


n 


Ô 
(106) dg(b1s---sbu) >F = (dts. Qabi 


1 OX; 


Evidently this is linear. The result we proved before now states that (b; ...,b„) € 4 (Der;£) if 
and only if 


(107) df (bis... bn) = 0 


for every f e X. This is a system of linear equations that characterizes A(Der;£). This leads to 
a formula for [DerpE: E], which we give in 


THEOREM 8.43. Let E = F(a, ... ,a,) and let x be a set of generators of the ideal of 


polynomials in F{x,...X,] such that flay,...,a,) = 0. Let dX denote the space of E™®* spanned 
by the linear functions df, f € X. Then 


(108) [Der;E: E] =n— [dX : E]. 


Proof. We have [DerpE:E] = [ADerpE:E], and A DerpE is the subspace of E™ of elements 
such that df(b,,..., b„) = 0 for all fe X. Hence (108) follows from linear algebra. D 


By the Hilbert basis theorem, we can take X = {f)..., fm}. Then it follows from linear 
algebra that [dX: E] is the rank of the Jacobian matrix 


ô ô 

Fay is Fn 
(109) 

Ofm Ofm 


5 (Aty-++5Gn) °° Z (ais... 0n) 


Ox; OXy 


Combining this with Theorem 8.43 we obtain the 


COROLLARY. Let E= F(a,...,.a,) and let X= {f,,...,f,,} be a finite set of generators for 
the ideal of polynomials in F[x1,...,X,] such that fla,..., a,) =0. Then 


(110) [DerpE: E] = n — rank J(f;,..., fa) 
where J(f,,.--,f,) is the Jacobian matrix (108). 
We obtain next a connection between [Der,;E:F] and the structure of E/F. We prove first 


PROPOSITION 8.18. Jf E = E(a),...,a,), then Der-E = 0 if and only if E is separable 
algebraic over F. 


Proof. Ifa € E is separable algebraic over F, then Proposition 8.17.2 applied to the derivation 
0 on F shows that D(a) = 0 for every derivation of E/F. Hence DerpE = 0 if E is separable 


algebraic over F. Now suppose that E is not separable algebraic over F. We may assume that 
{@},..-,4,} (r = 0) is a transcendency base for E/F. Let S be the subfield of elements of E that 


are separable algebraic over F(q...,a,). If S = E, then r > 0 and we have a derivation of 
F(a,...,a,) into E sending a; 1 < i < r, into any element we please in E. By applying 
Proposition 8.17.2 successively to a, + 1,...,d,, we obtain extensions of the derivation of 
F(a,..., a,) to E to a derivation of E/F. Hence we can obtain a non-zero derivation of E/F. 
Now let E # S. Then the characteristic is p 0 and E is purely inseparable over S. We have 0 £ 
E [E: S] < œ and we can choose a maximal subfield k of E containing S (K # E). Ifa € E, ¢ 
EX, then K(a) = E by the maximality of K. Moreover, the minimum polynomial of a over k has 
the form x?° — b since E is purely inseparable over S and hence over x. If e > 1, then K(a??~ 1) 
is a proper subfield of E properly containing x. Hence E = K(a) where x?— b is the minimum 
polynomial of a over K. By Proposition 8.17.3 we have a non-zero derivation of E/K. Since 
this is a derivation of E/F, we have Der,;E#0 in the case E # s also. O 


We can now prove the following theorem relating [Der-E:F] and the structure of E/F. 


THEOREM 8.44. Let E = F(a,...,a,). Then [Der;E: E] is the smallest s such that there 
exists a subset {aj..., djs} Of {ajl, ...,a,} such that E is separable algebraic over F(aj,... 
,ai,). Moreover, [Der-E;E] is the transcendency degree of E over F if and only if E is 
separable over F. 


Proof. We again consider the map 2 =, ,..., 4; of DerpE into E defined by (105). Let s = 
[DerpE: E] = [A(Der;£): E] and let D}, D>, ..., D, be a base for DerpE over E. Then s < n and 
A(Der;E) has the base (Djq),..., Dja,),..., (Dsa,-.., Dsa,) and so the s x n matrix (Dja;) has 
rank s. Hence we may suppose that the a; are ordered so that 


(111) det(Djaj))#0 if 1 <i, j Ss. 


Put k = F(a;..., aç) and let D e DergE c DerpE. Then D = }%1b;D;, b; € E, and D(a;) = 0 for 1 
<j < s. By (110), this implies that every b; = 0, so D = 0. Hence DergE = 0 and so by 
Proposition 8.18, E is separable algebraic over x. Conversely, let {ai,,..., ai,} be a subset of 
{a},...,a,} such that E is separable algebraic over F(ai;... ,a;). By reordering the a’s we may 
assume that the subset is {a,...,a,. We now map DerpE into E by D (Dai,,,-t ,Da,). The 
kernel of this linear map is the set of D such that D(a) = 0, 1 < k < t, and hence it is the set of 
D such that D(K) = 0 for x = F(a,),..., a). Now D(K) = 0 means that D € Der,(£) and since E 
is separable algebraic over X, this implies that D = 0. Thus the map D ~ (Daj,...,Da,) is 
injective. Hence t > s = [Der,;E:E]. This completes the proof of the first statement. To prove 
the second, we note first that if [Der-E: E] = s, then we may assume that Æ is separable 
algebraic over F(a,...,a,). Then {a,...,a,} contains a transcendency base for E/F, sos < r = tr 
deg E/E. Moreover, if r = s, then (a,...,s,} is a transcendency base. Hence this is a separating 


transcendency base and E is separable over E. Conversely, suppose that E is separable over £E. 
Then the proof of Theorem 8.39 shows that we can choose a separating transcendency base 
among the a; so we may assume this is {a,...,a,}. Then E is separable algebraic over F(a, 


...,a,) and hence, as we showed in the first part, r > [DerpE:E]. Since we had [DerpE: E] >r, 
this proves that [Der,£: E] =r. D 


EXERCISES 


1. Let E = F(q)...,a,,). Show that E is separable algebraic over F if and only if there exist n 
polynomials /,,...f,, E F[x),....x,,] such that f,(a),...,a,) = 0 and 


det (vt 40. 
ON; 


2. Let D be a derivation in E/F, K the subfield of D-constants. Show that a,,..., a,, E€ E are 
linearly dependent over K if and only if the Wronskian determinant 


ay d T Cm 


Da, Da, s.. Da. 


p" lay p"- la, eee p" : lm 


3. (C. Faith.) Let E = F(aj,...,a,,) and let K be a subfield of E/F. Show that [Der-K: K] < 


4. 


[Der -E:F]. 


Let A be a subalgebra of an algebra B over a commutative ring K. Define a higher 
derivation of rank m of A into B to be a sequence of K-homomorphisms 


(112) D = (Do = Lir Diyas kha) 


of A into B such that 
(113) Diab) = ) D,(a)D,- jb) 
j=0 


for a, b e A. Let A0” be the algebra K[x]/(x”* !) so a™ has a base (1, ô,..., 0”) where ô 
=x+(x"*!) and 6"*!=0. Note that B®,a is the set of elements 


(114) bo+bið+ = +b,” 


where b, = b; ® 1, and 6 = 1 ® ô and that an element (113) is 0 if and only if every b; = 0. 
Let z denote the homomorphism of B ® g 4” into B sending by + b)5 + ... + 5,6" ~ bo. 


Show that a sequence of maps D = (Do, D),..., Dm) of A into B is a higher derivation of 
rank m of A into B if and only if 


4(D): a> Do(a)+D,(a)d+ + + Dp (a)o” 


is a K-algebra homomorphism of A into B @, a” such that za(D) = 14. 


5. Let A and B be as in exercise 4. Define a higher derivation of infinite rank of A into B to 


3.17 


be an infinite sequence of homomorphisms D = (Dy = 1, D4, D2,...) of A into B such that 


(113) holds. Obtain a connection between higher derivations of A into B and 
homomorphisms of A into B[[x]], the algebra of formal power series in x with 
coefficients in B. 


GALOIS THEORY FOR PURELY INSEPARABLE EXTENSIONS 
OF EXPONENT ONE 


Let E/F be of characteristic p + 0 and let D be a derivation of E/F. Ifa € E, then D(a’) = pa? ~ 
l D(a) = 0. Hence every element of F(E”) is a constant relative to every derivation of E/F. If c 
Ee F(FP) and a € E, then D(ca) = cD(a) for D e Der,£. It is natural to replace F by F(E”) in 
studying the derivations of E/F. We shall now do this, so with a change of notation, we may 
assume that EP c F, which means that either E = F or E is purely inseparable of exponent one 
over F (see exercise 8, p. 495). We restrict our attention also to finitely generated extensions E 


= F(a};..., ap). 

Let {a, ...,a,,} be a minimal set of generators for E/F, so m = 0 if and only if E = F. 
Suppose that m > 0. Then a, ¢ F and a; ¢ F(a,...,a;_ 1) for 1 < i < m. Since aP €e F for all i, 
the minimum polynomial of a; over F and of a; over F(a)...,a;_;) for i > 1 has the form x? — b. 
Hence [F(a,):F] = p and [F(a]... ai): F(a... a — 1) = p, which implies that [E:E] = p”. 
Evidently E = F[a;,...,a,,| and since a,? € F for all i and [E:F]= p™, the set of monomials 


(115) ata an" = 0 Ski <p, 


constitutes a base for E/F. It is clear also that E/F is a tensor product of the simple extensions 
F(a))/F. 

Put F; = F(a,..., âp... am). Then E = F’(a;) and the minimum polynomial of a; over F; has 
the form x? — bi, b; e F. By Proposition 8.17.3 we have a derivation D; of E/F, such that D;(a;) 
= 1. Thus we have D;(a,) = 6;. It follows immediately that the D; 1 <i < ra, form a base for 
DerpE as vector space over E and hence [DerpE:E] = m. Since [E:F] = p”, this implies that 
[Der -E:F] = mp”. 

We now consider any field E of characteristic p, and derivations of E into itself without 
reference to a particular subfield of E. These are the endomorphisms D of the group (E, + ,0) 
that satisfy the condition D(ab) = D(a)b + aD(b). One deduces from this that D(ca) = cD(a) if 
c is in the prime field P, so D can be regarded as a derivation of E/P. However, we shall 
simply say that D is a derivation of F into itself. Let Der E denote the set of these maps. Then 
Der E is a set of endomorphisms of the additive group (£, + ,0) having the following closure 
properties: (1) Der E is a subspace of End (E, +, 0) regarded as a vector space over E by 
defining bL for b e E, E e End(£, +, 0) by (bL)(a) = b(L(a)), (2) if D,, D> € Der E, then [D,, 
D] € Der £, and (3) if D e Der E, then D?e Der E. We shall now call any subset of End (E, + 
,0) having these closure properties a p-E-Lie algebra of endomorphisms of (E, +, 0). We use 
this terminology for want of anything better, but we should call attention to the fact that a p-F- 
Lie algebra need not be a Lie algebra over Æ in the usual sense, since the composition [D,, D>] 
is not F-bilinear. 

If F is a subfield of E such that [E: F] < œ and E is purely inseparable of exponent < 1 over 
F, then DerpE is a p-E-Lie algebra of endomorphisms of (F, + ,0). Moreover, we have seen 


that [Der FF] < œ. We shall now show that every p-E-Lie algebra of derivations of E having 
finite dimensionality over F is obtained in this way. For, we have 


THEOREM 8.45 (Jacobson). Let E be a field of characteristic p + 0, F a subfield such 
that (1) [F:F] < © and (2) F is purely inseparable of exponent < 1 over E. Then DerpE is a 
p-E-Lie algebra of endomorphisms of (F, + ,0) such that p[Der;E:E] Conversely, let Z be a 
p-E-Lie algebra of derivations of E such that [2 :E] œ% and let F be the set of 2 -constants of 
E, that is, the elements that are D-constants for every D e F. Then [E:F] < œ and E is purely 


inseparable of exponent < 1 over F. Moreover, 4 = DerpF and if (Di,..., D,,) is a base for Z 
over E, then the set of monomials 


(116) Dy Dy Dat, Oki <p, (DP =1) 
form a base for End;E regarded as a vector space over E. 


Proof. The first statement has already been proved. To prove the second, we use the same idea 
we used to establish the results on finite groups of automorphisms in fields: We use the given 
set of endomorphisms to define a set of endomorphisms L satisfying the conditions in the 
Jacobson-Bourbaki correspondence. In the present case we take L to be the set of F-linear 
combinations of the endomorphisms given in (116). Evidently Z contains 1 = D,°... D,,°, so L 
contains Eg = F; and [L:F] <p™. It remains to show that L is closed under multiplication by the 
D,. We note first that if D is a derivation in F, then the condition D(ab) = D(a)b + aD(b) gives 
the operator condition 


(117) Dag = aD + D(a) 


where ag and D(a), denote the multiplications by a and D(a) respectively. Using this relation 
we see that Di(aD," ... D,") = aD,D" ... D," + D(a)Di*! ... Dm”. Hence to prove that 
D,LCI, it suffices to show that D,D,‘! ’D,,™eL for all i and all k, such that 0 < kj p. We shall 
prove this by showing that Dp | 'D,” is a linear combination with coefficients in E of the 
monomials DF” D,,” such that 0 < k; < p and }` k; <k;,;. The argument for this is very similar 
to one we used in the study of Clifford algebras (p. 230): We use induction on `k; Xk, and for 
a given k; induction on i. We have at our disposal the formulas 


(118) DP=) ba Dy, bx € E, 
k 


and 


(119) D;D; = D;Di+ Y dinDi, dia. € E, 
k 


that follow from the conditions that 7 is closed under pth powers and under commutators. The 
result we want to prove is clear if) k, = 0, so we may suppose that >, k, > 0. Then some k # 0 


and we suppose that k; is the first of the k’s that is > 0. Then DF! ... Dy" = Dj ... Dy”. If i 
<j, then... D,/” is one of the monomials (116) for which the sum of the exponents is ¥ jk, + 1. 
Hence the result holds in this case. The same thing is true if i = 7 and k; < p— 1. Now let i=j, 
k, = p- 1. Then D,D*j ”D, = DD i" ” D_ Km and the result follows by induction if we 


replace D? by }, bD. Now assume that i > j. Then by (119), 
D:D} i Dpt" = DDD js Le. Dpi” +) din DD jh” kass Dink 


The result follows in this case also by applying both induction hypotheses to the right-hand 
side. This establishes the key result that L is closed under multiplication. Hence the Jacobson- 
Bourbaki correspondence is applicable to L, and this shows that if F = {aļagB = Bag for B e€ 


L}, then F is a subfield such that [E:F] = [L:E] and L — EndpE. By definition of L we have [L: 
EJ] < p” and equality holds here if and only if the monomials (116) form a base for L over £E. 
Since 7) generates L, the conditions defining F can be replaced by agD = Da, for all D e7. 
By (117) this is equivalent to: a is a D-constant for every Z. Hence F is the set of 7 -constants 
and ¥ c a DerpE. Then E is purely inseparable of exponent =1 over F. We have [F:F] — 
[F:F] < p”, so [E:F] = p” with m < m. On the other hand, Z) contains m linearly independent 
derivations (over F), so [Der-E: E] = m and hence [E:F] > p”. It follows that 7) — DerpF, 


[Der,E: F] =m, and [LF] = [FF] = p”. This completes the proof. D 


EXERCISES 


In these exercises we assume that EF is purely inseparable of exponent <1 over F of 
characteristic p and that Æ is finitely generated over F. 


1. (Baer.) Show that there exist a derivation D of E/F such that F is the subfield of E of D- 
constants. 


2. Let D be a derivation of E/F such that F is the subfield of D-constants. Show that the 
minimum polynomial of D as a linear transformation in E/F has the form 


"=l 
xP" + cx" + ia + Cms ‘ort = F 


where p” = [E: F]. Show that (1, D,..., D” D is a base for End pE as vector space over 
E. 


3. (J. Barsotti, P. Cartier.) Show that if D is a derivation in a field F of characteristic p # 0, 
then DP- Ha~ ! Da) = a~ | DPa — (a~ Da. 


4. (M. Gerstenhaber, m. Ojanguren-M.R. Sridharan.) Let E be a field of characteristic P # 0 
and let V be an E subspace of Der E closed under pth powers. Show that V is a Lie 
subring. Note that this shows that closure under Lie products is superfluous in the 
statement of Theorem 8.45. 


5. (Gerstenhaber, Ojanguren-Sridharan.) Extend Theorem 8.45 to obtain a 1-1 
correspondence between the set of subfields F of E such that E/F is purely inseparable 
of exponent 1 and the set of p-E-Lie algebras of derivations of E that are closed in the 


finite topology. 


8.18 TENSOR PRODUCTS OF FIELDS 


If E/F and K/F are fields over F what can be said about the F-algebra E ®,K? In particular, is 
this a field or a domain? It is easy to give examples where E®pK is not a field. In fact, if EAF 
then F ®pE is never a field. To see this we observe that by the basic property of tensor 
products, we have an additive group homomorphism 7 of E ® p E into E such that a® b ~ ab, a, 
b e E. Also it is clear from the definitions that this is an F-algebra homomorphism. Now let 
aéE,éF. Then 1, a are F-independent in E and hence 1 ®@ 1, 7 ® a, a® 1, and a® a are F- 
independent in F ®-E. Hence / ®a—a®1#0 butyl ®@a—a®@1)=a-—a = 0. Thus ker nis a 
non-zero ideal in E ®;E. The existence of such an ideal implies that F ®,F is not a field. 

It is readily seen also that if x and y are indeterminates then F(x) ®;F(y) is a domain but not 
a field (see below). 

Another fact worth noting is that if F is algebraic over F and F ®pK is a domain then this 
algebra is a field. This is clear if F is finite dimensional over F. For, then E ®pK can be 
regarded as a finite dimensional algebra over K ([F ®@;K-:K/ = [F:F]) and a finite dimensional 
domain is necessarily a field. The general case follows from this since any aeEE ®pK is 
contained in a subalgebra isomorphic to an algebra E,®;K where F)/F is finitely generated, 
hence finite dimensional over F. If a # 0 the corresponding element of Ey ®;K is invertible. 
Hence a is invertible in E®;K. 

We shall now proceed to a systematic study of tensor products of fields. In our discussion 
separability will mean separability in the general sense of Definition 8.4, pure inseparability 
of E/F will mean that £ is algebraic over F and the subfield of E/F of separable elements over 
F coincides with F. We shall say that F is algebraically closed (separably algebraically 
closed) in E if every algebraic (separable algebraic) element of E is contained in F. We prove 
first 


THEOREM 8.46. Let E/F and KIF be extension fields of F. 

(1) If E/F is separable and K/F is purely inseparable, then E ® pK is afield. On the 
other hand, if EIF T% not separable, then there exists a purely inseparable 
extension KIF of exponent l such that E ®;K contains a non-zero nilpotent 
element. 

(2) If E/F is separable algebraic, then E ®,K has no nilpotent elements for arbitrary 
KIF, and E ®,k is a field if F is separably algebraically closed in K. 


(3) The elements of E Bp k are either invertible or nilpotent if either E/F is purely 
inseparable and KIF is arbitrary, or E/F is algebraic and F is separably 


algebraically closed in K. 


Proof. In(1) and in the first part of (3) we may assume the characteristic is p # 0. 

(1) Assume E/F'is separable and K/F is purely inseparable. The separability implies 
that if a)...,a,, are F-independent elements of E then these elements are linearly 
independent over F/”° (contained in the algebraic closure of E) for every e = 0, 1,2, 
.... This implies that the elements a), ... a,,?° are F-independent for every e. Now let 
K be purely inseparable over F and let z= Xa; ® c; # 0 in EDK where a€ E, c; e K. 
We may assume that a; are F-independent and we have an e such that oP e F,l<i< 
m. Then z?° =a” O P =Y cP"; aP @ 1 0. Hence YP; aP + 0 and this element of 
E is invertible. Thus z2”® is invertible and hence z is invertible in E®,-K. Then E ®-K 
is a field. 

Next assume F/F is not separable. Then there are F-independent elements q),...,a,, 
€ E that are not F’? independent. Hence we have cP, c, € F not all 0 such that 97° 
= 0. Then = 0 but Yc ag £ 0 since not every c, = 0. Consider the field K = F(c!P a; = 
0,...5 C,/?). We have k ? F since K = F implies the a; are F-dependent. Now 
consider the element z = Ya® c,!/PEE ®,K. This is non-zero since the a; are F- 
independent and not every cj! P = (in K. On the other hand, z? = YaP@c;= Fcap @ 1 
= 0. Hence z is a non-zero nilpotent in E ®;K. 

(2) Assume E/F is separable algebraic, K/F is arbitrary. We have to show that E ®@,K 
has no non-zero nilpotents and that € ®;K is a field if F is separably algebraically 
closed in K. Using the argument at the beginning of this section, we obtain a reduction 
to the case in which £ is finitely generated, hence [E : F] < œ. In this case since F is 
separable algebraic, E = F[a] = F(a) where the minimum polynomial M(A) of a over 
F is irreducible and separable (meaning, E.g., that (m(1),m'(A)) = 1). Then E ®}K = 
K[a] where the minimum polynomial of a over K is m(A). Hence E ® pK = 
K[A]/(m(A)). Since M(A) is separable, we have the factorization in K[A] of m(A) as 
m(A) = m,(A)...m,(A) where the m = 2) are distinct irreducible monic polynomials. 
Then E ®;K = K[A]/(m(A)) = ©", K[A]/(m(4)) (exercise 4, p. 410 of BAT). Since 
K[A]/(m{A)) is a field we see that E ®;K is a direct sum of fields. Clearly an algebra 
having this structure has no non-zero nilpotent elements. This proves the first 
assertion of (2). 

The coefficients of the mA) are separable algebraic over F since they are 
elementary symmetric polynomials in some of the roots of m(A) and these are 
separable algebraic over F. Hence if F is separably algebraically closed in K then 
mA) € F[A]. Thenr = 1 and E pK = K[A]/(m(A)) is a field. 


(3) Let E/F be purely inseparable, K/F arbitrary. Let z = )))""a; ®@ c, a; © C; € K. We can 


choose e so thata}! € F, 1 <i < m. Then Z° = Jaf? O cP =189) "aff cP’ e 1® 
K. If 2° = 0 then 2° and hence z is invertible. Otherwise, z is nilpotent. 

Next let E/F be algebraic and K/F separably algebraically closed. Let S be the 
subfield of E/F of separable elements. Then £/S is purely inseparable. Now E pK = 
E @ (S pK) (Exercise 13 (iv), p. 148). Since S/F is separable algebraic and K/F is 
separable algebraically closed, S ®p K is a field by (2). Since E/S is purely 
inseparable it follows from the first part of this proof that the elements of E 8 (S ® 
pK) are either nilpotent or units. Hence this holds for E ®;K. O 


We consider next tensor products of fields in which one of the factors is purely 
transcendental. 


THEOREM 8.47. Let E/F be purely transcendental, say, E = F(B) where B is a 
transcendency base and let KIF be arbitrary. Then E ®pK is a domain and its field 


offractions Q is purely transcendental over k = | ®pK with B = B® 1 as transcendency base. 


Moreover, if F is algebraically closed (separably algebraically closed) in k then E = F(B) is 
algebraically closed (separably algebraically closed) in Q = K(B). 


Proof. For simplicity of notation we identify E£ and K with the corresponding subfields £ ® 1 
and 1 ® K of E®pK. These are linearly disjoint over F in A = E ®pK. A consequence of this is 


that if a subset S of E is algebraically independent over F then it is algebraically independent 
in A over K. It suffices to see this for S = {5,,..., s,,} In this case algebraic independence over 


F is equivalent to the condition that the monomials s ;‘...s*™, k, > 0 are distinct and linearly 


independent over F. Since this carries over on replacing F by K it follows that S is 
algebraically independent over K. In particular, this holds for the transcendency base B of E. 
Consider the subalgebra K/B/ If C is a finite subset of B then K[C] is a domain (Theorem 2.13 
of BAI, p. 128). It follows thatnK[B] is a domain and this is a subalgebra of A = F(B)®,K. Let 


z € A. Thenz = $ a;c;, a;€ F(B), c;€ K. We can write a; = pq ! 


pq | where p = $p;c; € k[B]. Conversely, if p € K[B] and q € F[B] q #0, then p and q`! € A 
so pg ! € A. It follows that A is the localization K/B] Fpj* Of K/B] with respect to the 
multiplicative monoid F[B]* of non-zero elements of F/B]. Since K/B] is a domain, its 
localization K/B/];;g)* is a domain. Moreover, this is a subalgebra of the localization 
K[B] prp)* which is the field of fractions Q of K/B] and of A. Evidently O = K(B) the subfield 
of Q/K generated by B. Since B is algebraically independent over K we see that Q is purely 
transcendental over K with transcendency base B. This proves the first assertion of the 
theorem. 

To prove the second assertion we shall show that if K(B) contains an element that is 
algebraic (separable algebraic) over F(B) not contained in F(B) then K contains an element 
that is algebraic (separable algebraic) over F not contained in F. Clearly, if such an element 
exists it exists in K(C) for some finite subset C of B. Hence it suffices to prove the result for 


where p, q € FUB]. Then z = 


finite B and then by induction, it is enough to prove the result for K(x), x an indeterminate. 
Hence suppose z € K(x) is algebraic over F(x) and z ¢ F(x). Let A" + b,A”~ 1+ & +b, be the 
minimum polynomial of z over F(x) so n > 1. Write b; = pq ~ !, p; q € F[x]. Then w = qz has 
minimum polynomial 4” + P)A"”~ ! + qpA"~ 2+... + q"~ 'P.. Replacing z by gz we may 


assume the Now write z = rs~ ! where r, s € k[x] are relatively prime. Then we have 
(120) =r" = barts + br" 25? + ++ + d,s” 


If deg s > 0 then s has a prime factor in K[x]. By (120) this is also a factor of r”, hence ofr, 
contrary to the relative primeness ofr and s. Thus s is a unit so we may assume z € K[x] so z = 
z(x) = Cg + Gxt ... + Cx”, C; E K. We claim that the c; are algebraic over F. We have the 
relation z(x)” + b,z(x)”?~! +... + b, =0 where b; = b(x) € F[x]. Hence for every a e F we 
have 


(121) z(a)" + b,(a)z(a)""* +++: + b,(a) = 0. 


Since b{a) € F this shows that the element z(a) of K is algebraic over F. If F is infinite we 
choose m + 1 distinct elements 44, a>,..., 4m + 1 in F and write 


2(a,) = Co + Cia +°°° + Cnt 


z(a) = Co + CyAy + °° + Cy Ad 
(122) 


ZA) = Co + C1Qm + °° + Cnam 


Since the a; are distinct the Vandermonde determinant det (a) # 0. Hence we can solve (122) 
for the c’s by Cramer’s rule to show that every c, is a rational expression with integer 
coefficients in the a; and the z(a;). Since the a; and z(a;) are algebraic over F it follows that 
every c, is algebraic over F. If F is finite we replace F by its algebraic closure F which is 
infinite. Then the argument shows that every c, is algebraic over F and since F is algebraic 
over F it follows again that the c, are algebraic over F. Since z ¢ F(x), some c,éF and hence 
we have an element of K that is algebraic over F and is not contained in F. 

Finally, we suppose that z is separable algebraic over F. Then F(co,...,c,,) contains an 
element not in F that is separable algebraic over F. Otherwise, this field is purely inseparable 
over F and hence there exists a pf, p the characteristic such that oe e F,0<k<m. Then m = 


cy + cP P+... CoP xm" e F(x) contrary to the separability of z over F. Thus if K(x) 


contains an element that is separable algebraic over F(x) and is not in F(x) then K contains an 
element separable algebraic over F not in F. O 


In our next result we weaken the hypothesis that E/F is purely transcendental to 
separability. Then we have the following 


THEOREM 8.48. Let E/F be separable, K/F arbitrary. Then E ®pK has no non-zero 
nilpotent elements. 


Proof. It is clear that it suffices to prove the theorem in the case in which E/F is finitely 
generated. In this case E has a transcendency base B over F such that E is separable algebraic 
over F(B) (Theorem 8.39). Then E @pK = E Orp (F(B) @pK). By the last result F(B) @pK is a 


domain. If Q is its field of fractions then E ®,(g)(F(B) FK) is a subalgebra of E ®p(g)Q. Since 

E is separable algebraic over F(B), E ®pg)Q has no nilpotent elements # 0, by Theorem 8.46 

(2). Hence E Orp (FB) ®pK) has no non-zero nilpotent elements and this is true also of E ® 
K. 0 

F . 


Next we consider the situation in which F is separably algebraically closed in one of the 
factors. 


THEOREM 8.49. Let F be separably algebraically closed in E and let K/F be arbitrary. 
Then every zero divisor of E pK is nilpotent. 


Proof. Let B be a transcendency base for K/F. Then E ®pK= (E @pF(B)) @p@)K. By the last 
result E ®,F(B) is a domain and F(B) is separably algebraically closed in the field of fractions 
Q of E @, F(B). Since F(B) is separably algebraically closed in Q and K is algebraic over 
F(B), it follows from Theorem 8.46 (3) that the elements of O ®rp)K are either invertible or 
nilpotent. Now let ze(E ®,F(B)) Ərp) K be a zero divisor in this algebra. Then z is a zero 
divisor in the larger algebra O ®xp) K. Hence z is not invertible in O @p;—)K SO Z is 
nilpotent. Since E ®pK =(E @pF(B)) @pg)K it follows that every zero divisor of E @pK is 
nilpotent. 0 


We can now prove our main result on the question as to when the tensor product of two 
fields is a domain. 


THEOREM 8.50. Let E/F and KIF be extension fields of F. Assume (1) either EIF or KIF 
is separable and (2) F is separably algebraically closed in either E or K. Then E ®p k is a 


domain. 


Proof. By the last result if one of the factors has the property that F is separably algebraically 
closed in it then the zero divisors of E ®p K are nilpotent. On the other hand, by Theorem 8.48, 


if one of the factors is separable then E ®p K has no non-zero nilpotent elements. Hence E ®; 
K has no zero divisor = 0.0 


A class of extension fields that is important in algebraic geometry (see Weil’s Foundations 
of Algebraic Geometry, American Mathematical Society Colloquium Publication v. XXIX, 
1946 and 1960) is given in the following 


DEFINITION 8.6. An extension field E/F is called regular if (1) E is separable over F and 
(2) F is algebraically closed in E. 


We remark that a separable extension field E/F contains no purely inseparable subfield. 
Hence we can replace condition (2) in the definition of regularity by: (2') F is separably 
algebraically closed in E. The sufficiency part of the following theorem is a special case of 
Theorem 8.50. 


THEOREM 8.51. An extension field E/F is regular if and only if E ®p k is a domain for 
every field K/F. 


Proof. It remains to prove the necessity of the two conditions. The necessity of separability 
follows from Theorem 8.46 (1). Now suppose F is not algebraically closed in Æ. Then Æ 
contains a finite dimensional subfield K ? F. Then E ®p k contains K ®p K that is not a field. 


Since K ® K is finite dimensional it is not a domain. Hence we have the necessity of condition 
(2).0 


One readily sees that if F is algebraically closed then any extension field E/F is regular. 
Then E ®;K is a domain for any K/F. 


In the situation in which E ®p K is a domain for the extension fields E/F and K/F we shall 
denote the field of fractions of E @; K by E - K (or E'pK). 


EXERCISES 


1. Show that if E/F is purely transcendental then E/F is regular. 
2. Show that if £,/F and E,/F are regular then £; - E; is regular. 


8.19 FREE COMPOSITES OF FIELDS 


Given two extension fields E/F and K/F, a natural question to ask is: What are the possible 
fields over F that can be generated by subfields isomorphic to E/F and K/F, respectively? To 
make this precise we define the composite of E/F and KIF as a triple (T, s, t) where T is a 


field over F and s and ¢ are monomorphisms of E/F and K/F, respectively, into T/F such that 
T/F is generated by the subfields s(£) and t(K), that is, T = F(s(£), t(K)). The composites (T, 
s, t) and (I", s’, t") of E/F and K/F are said to be equivalent if there exists an isomorphism u: T 
— I” such that the following two diagrams are commutative: 


F u 


Of particular interest for algebraic geometry are the composites that are free in the sense of the 
following 


DEFINITION 8.7. Afield composite (T, s, t) of E/F and KIF is called free if for any 
algebraically independent subsets C and D of EIF and KIF respectively, s(C) U t(D) = Ø 
and s(C) ® t(D) is algebraically independent in T/F. 


Since any algebraically independent subset can be imbedded in a transcendency base, it is 
clear that the condition that (T, s, £) is free is equivalent to the following: for every pair of 
transcendency bases B and B’ of E/F and K/F respectively, s(B) N t(B’) = © and s(B) U t(B’) 
is algebraically independent. We now observe that the word “every” can be replaced by 
“some” in this condition; for, we have 


LEMMA 1. A composite (T, s, t) of E/F and KIF is free if and only if there exist 
transcendency bases B and B' of E/F and KIF respectively such that s(D) N t(D') = © and 
s(C) Ut(D') is algebraically independent in T/F. 


Proof. The necessity of the condition is clear. To prove sufficiency, suppose we have 
transcendency bases B and B’ for E/F and K/F such that s(B) ® t(B') = Ø and s(B) U t(B’) is 
algebraically independent over F. To prove freeness of (T, s, t), it obviously suffices to show 
that if B, is another transcendency base for E/F, then s(B,) U (B^ = Ø and s(C) U t(C) is 


algebraically independent over F. Hence it suffices to show that if C is a finite algebraically 
independent subset of E/F and D is a finite subset of B’, then s(C) ® t(D) = Ø and s(C) U t(D) 
is algebraically independent over F. Now there exists a finite subset G of B such that C is 
algebraically dependent over F on G. Obviously s(G) ® t(D) = Ø and s(G) U (D) is a 
transcendency base for F(s(G), s(C), t(D))/F, so tr deg F(s(G), s(C), t(D))/F = |G| + |D|. We 
also have tr deg F(s(G),s(C))/F = tr degF(s(G))/F = |G| and tr deg F(s(C))/F = |C]. It follows 
that tr deg F(s(G), s(C), t(D))/F(s(C), t(D)) < |G| — |C] (see exercise 1, p. 517). From this and 
the above formula for tr degF(s(G),s(C),t(D))/F, we know that 


tr deg F(s(C), (D))/F > (|G| + |D|) — (|G| — |C) = |D| + |C]. 


Hence s(C) ® t(D) = Ø and s(C) U t(D) is algebraically independent over F. Hence (T, s, £) is 
free. O 


We note also that if B and B’ are transcendency bases for E/F and K/F satisfying the 
conditions of the lemma then s(B) U t(B') is a transcendency base for I/F. This is clear since 
the elements of s(E) and t(K) are algebraic over F(s(B), ¢(B')) and since T is generated by s(£) 
and ¢(K), it follows that IT is algebraic over F(s(B), t(B’)). Hence s(B) U (B^ is a 
transcendency base for T/F. 


We can use these results to construct a free composite for any two given fields E/F, K/F. 
Let B and B’ be transcendency bases for E/F and K/F. Suppose first that B and B" are finite, 
say, B = {€)....¢m}, B' = {č ...5 ,}. Let Q be the algebraic closure of the field F(x1,..., Xm 4 
n) Where the x; are indeterminates. We have monomorphisms s' and t’ of F(¢)..., ¢,,)/F and 
F(¢\,..., ¢,/F, respectively, into Q such that s'¢;=x,; 1 < i < m, and a Nyy es 
E is algebraic over F(B) and Q is algebraically closed, s’ can be extended to a monomorphism 
s of E/F into Q/F (Exercise 1, p. 475). Similarly, t! can be extended to a monomorphism ¢ of 
KIF into Q. Then it is readily seen that if [ = F(sE, tK) then (T, s, t) is a free composite of E 
and K. 

If either B or B” is infinite we modify the procedure used for B and B’ finite as follows. We 
may assume |B’| > |B|. Then let X be a set disjoint from B and B’ such that |X] = |B'|. We can 
decompose X as a disjoint union of two sets Y and Z such that | Y| = |B], |Z| = |X| = |B’|. Let 
F(X) be the field of fractions of the polynomial algebra F[x]. Let Q be the algebraic closure of 
F(X). Then, as before, we can define a monomorphism s of E/F into Q/F whose restriction to B 
is a bijective map of B onto Y and a monomorphism t of K/F into Q/F whose restriction to B’ is 
a bijective map of B’ onto Z. Then (T, s, t) for l = F(sE, tK) is a free composite of E£ and K. 

We wish to give a survey of the isomorphism classes of the free composites of two given 
fields E/F and K/F. First, we consider the composites of E and K that need not be free. We 
form E ®p K and let P be a prime ideal in this F-algebra. Then (E ®p K)/P is a domain whose 
field of fractions we denote as I’p. We have the homomorphism s„:a ~a + P (a=a ® 1) of E/F 


1 <j <n. Since 


into (E ®p K)/P and hence into I p/F. Since Æ is a field this is a monomorphism. Similarly, we 
have the monomorphism tp:b ~ b + P of K/F into I p/F. Since E ® pK is generated by E and K, 
(E p K)/P is generated by its subalgebras sp(£), tp(K), and hence Ip is generated as a field 
by Sp(£), tp(K). Thus (Tp, sp, tp) is a composite of E/F and K/F. 

We note next that distinct prime ideals P and P' of E ®p k define inequivalent composites 
(Tp, Sp, tp) and (Ts sp, tp’). If these are equivalent then we have an isomorphism u of Ip onto 
I, such that s, = us, and t,. = ut,. Then u(a + P) = us,a=S,a=art P' for a € E and u(b + 
P)=b+P' for be K. Then uÈ 7,5; + P=} fbit P = i b e K. If) "bit P=0 so} ab; + 
p' = 0 and )'a,;b; € p'. Thus P c P' and by symmetry P’ a p. Hence P = P' contrary to our 
hypothesis. 

Now let (Z; s, t) be any composite of E/F and K/F. The homomorphisms s and f¢ give rise to 
the homomorphism s ® t of E ®p K into I which maps a,b; a; € E, b; € K, into >) s(a;)t(b;) of 
T. Since the image is a domain the Kernel is a prime ideal P. Hence we have the isomorphism 
u of (E ®,K)/P onto the subalgebra s(£)t(K), which can be extended to an isomorphism u of 


the field of fractions T p of (E ®p K)/P onto I = F(s(£),t(£)). It is clear that u is an equivalence 
of composites. We have therefore established a 1 — 1 correspondence between the set of prime 
ideals of E ®p K and the set of equivalence classes of composites of E/F and K/F. 


It remains to sort out the composites (T p, Sp, tp) that are free. For this we shall prove 


THEOREM 8.52. The composite (Tp, Sp, tp) is free if and only if every element of P is a 
zero divisor in E pK. 


We shall identify E and K with their images in E ®p K so we regard E and K as subfields of 
E @p K with the property that if S and S’ are linearly independent subsets of E/F and K/F 
respectively then the map (s,s') ~ ss’ is a bijection of S x S’ with SS’, and SS’ is a linearly 
independent subset of E 8p K. 


Let B and B’ be transcendency bases for E/F and K/F, respectively, and let m and M’ be the 
submonoids of the multiplicative monoids of E and K generated by B and B’. Since B and B' 
are algebraically independent, M and M’ are linearly independent in E/F and K/F and hence 
MM' is linearly independent in E ®p K. Now the subspace FMM' spanned by MM" is the 


subalgebra F[B U E’] and also the subalgebra F[B]F[B']. Thus MM' is a base for F[B U B'] = 
F[B|F[B']. Since B U B' is algebraically independent, F[B LU B’] is a domain. The field of 
fractions F(B U B’) of this domain contains the subfields F(B) and F(B’) and it is readily seen 
that the subalgebra F(B)F(B’) generated by F(B) and F(B’) is the set of fractions a(cc’) ~ ! 
where a € F[B] F[B'] = FIPIF[B'], c e F[B] and c'e F[B']. 

We can now state 


LEMMA 2. The composite (T, Sp, tp) is free if and only if F(B)F(B') ® P = 0. 


Proof. By Lemma 1, (Tp, Sp, tp) is free if and only if's,(B) N t,(B') = Ø and sp(B) U tp(B’) is 
algebraically independent over F. Since sp(B) = {b + P|b € B} and tp(B’) = {b' + P|b' € Bt 
these conditions hold if and only if the cosets q + P q € MM’, are linearly independent, where 
mand M' are as above. Hence (T p,Sp, tp) is free if and only if FMM' N P = 0. Since FMM' = 
F[B]F[B’] this condition is F[B]F[B'] N P = 0. Since F(B)F(B’) = {a(cc')~ 'la € FIBIFTB'), c 
c F[B],c' € F[B']}, FIB]FIB'] N P=0 © F(B)F(B’)) N P = 0. Hence {I>p, sp, tp) is free if and 
only if F(B)F(B') N P=0.0 


We require also 
LEMMA 3. E®pkis integral over F(B)F(B’). 


Proof. Since B is a transcendency base for E, E is algebraic, hence, integral over F(B). A 
fortiori E is integral over F(B)F(B’). Similarly, K is integral over F(B)F(B’). Since the subset 
of E ®p K of elements integral over /(B)F(B’) is a subalgebra, it follows that E ®p K = EK is 


integral over F(B)F(B’). 0 


We can now give the 


Proof of Theorem 8.52. Suppose first that every element of P is a zero divisor in E @;K. Let a 
e P ® F(B)F(B’). Then a is a zero divisor in E ®pK. We claim that a is a zero divisor in 
F(B)F(B’). To see this let {c,} and {dB} be bases for E/F(B) and K/F(B’') respectively, and let 
{m,} and {ns} be bases for F(B)/F and F(B')/F respectively. Then {c,m,} is a base for E/F 
and {dgn,} is a base for K/F. Hence {c,dgm,n;} is a base for E ®p K over F. Then every 
element of E ®; k can be uniquely written as a finite sum }` qapCadp where qap E F(B)F(B’). 
Now we have an element } qagCadg # 0 such that > gy pCgdg = aÈ qapCadp) = 0. Since a € 
F(B)F(B’) this implies that every aq,g = 0 and since >) qagCadg F 0 some qag +o SO a is a Zero 
divisor in F(B)F(B’'). Since a e F(B)F(B') and F(B)F(B’) is a domain this implies that a = 0. 
Hence we have proved that if every element of P is a zero divisor in E ®p K then P N 
F(B)F(B') = 0 and (Ip, Sp, tp) is free by Lemma 2. Conversely, assume (Tp, Sp, tp) is free so P 
N F(B)F(B’) = 0. Let b e P. Then b is integral over F(E)JF(E') and hence we have a relation 
b” + c/b"~!+...+E,,=0 for C; € F(B)F(B’) and we may assume n is minimal. Then we have 
c, =—b"—...—bc, _,€ P since b e P. Thus c, € P N F(B)F(B’) and hence c, = 0. Then 
b(b"~!+¢,b"~*+... +c, ,) =0 and by the minimality ofn, b, 1 +c¢)b"~*+...+¢, #0 
so b is a zero divisor. Thus freeness of (T p,s,,,tp) implies that every element of P is a zero 


divisor in E N p K. This completes the proof. O 


We now consider an important special case of this theorem, namely, that in which F is 
separably algebraically closed in Æ. In this case, by Theorem 8.49, the zero divisors of E ®p K 


are nilpotent. Since the converse holds, nilrad E ®p K is the only prime ideal in E ®p K that 
consists of zero divisors. Hence we have 


THEOREM 8.53. Jf K/F is arbitrary and F is separably algebraically closed in E, then in 
the sense of equivalence there is only one free composite of E/F and K/F. 


This applies in particular if E is a regular extension of F. Then E ®pK is a domain for any 
K so nilrad E ®pK = 0. Then the free composite of E/F and K/F is E - K which was defined to 
be the field of fractions of E (x) pK. 

Finally, we consider a very simple case of composites that will be needed in valuation 
theory (see section 9.9), that in which one of the fields, say, E/F is finite dimensional. In this 


case all that we shall require of the foregoing discussion is the 1— 1 correspondence between 
the set of prime ideals P of E ®p K and the equivalence classes of composites (Tp, sp, tp). We 


note also that since E ®p K is finite dimensional over K this ring is artinian. Hence the results 


we derived in section 7.11 are applicable. Using the fact that a domain that is finite 
dimensional over a field is a field we see that the prime ideals of E (x) x F are maximal. By 


Theorem 7.13, there are only a finite number of these. Moreover, the proof of Theorem 7.13 
shows how to determine the maximal ideals. We remark also that if E is algebraic over F, E/F 
has a vacuous transcendency base. It follows from Definition 8.7 that the composites of E/F 
with any K/F are free. 

We now suppose E has a primitive element, say, E = F(0). This is always true ;if E/F is 
finite dimensional separable (BAI, p. 291). We have the following 


THEOREM 8.54. Let KIF be arbitrary and let E = F(@) where @ is algebraic over F with 
minimum polynomial F(x) of degree n. Let fi(x),... f(x) be the monic irreducible factors of 


fix) in Kix]. Then there are h inequivalent composites (T; S; t;) of E/F and K/F where I; = 
K[x]/(f(x)), s; is the monomorphism of E/F into T/F such that 0 ~ x + (fx)) and t; is the 
monomorphism b ~ b + (f(x)) of KIF into T/F. Any composite of E/F and K/F is equivalent 
to one of these. 


Proof. We have the isomorphism of E = F(@) over F onto F[x]/(f (x)) such that 0 ~ X =x + 
(f(x)). As we have seen before p. 546), E p K = K[x]/(f(x)). The maximal ideals of 


K[x]/(f(x)) have the form (f,(x))/(f(x)) where f,(x) is a monic irreducible factor of f(x) in K[x]. 
Then T; = Ex E) = ECVV fx) is a field over F. We have the 
monomorphism s; of E/F into T/F such that 0 ~ x + (f{x)) and the monomorphism t; of K/F into 
I/F such that b~ b + (f,{(x)), b € K. Clearly s(£) and t{K) generate J. Hence (T; s; t;) is a 
composite of E/F and K/F. The rest follows from what we proved before. D 


i? 


EXERCISES 


1. Determine the composites over Q of È with 
(1) Q(@), œ a primitive cube root of 1 
(2) a(v2) 
2. Determine the composites over Q of Q(. =1) and a(v3). 


3. Suppose E/F and K/F are finite dimensional Galois and E ®p K is a domain hence a 
field. Is (E ®p K)/F Galois? 
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Valuation Theory 


The valuation theory of fields that we consider in this chapter, like commutative ideal theory, 
had two sources: number theory and algebraic geometry. The number theoretic source was 
Hensel’s discovery of p-adic numbers. Hensel introduced the ring of p-adic integers 
essentially as we have done in Chapter 2: as inverse limit of the rings 2/(p”). Later Ktirschak 
(1913) introduced the concept of a real valued valuation of a field and showed that Hensel’s 
field of p-adic numbers could be viewed as the completion of Q relative to the p-adic 
valuation. The algebraic geometric source of the theory appears to have been the introduction 
of the concept of a place by Dedekind and Weber, for the purpose of providing a precise 
definition of the Riemann surface of an algebraic curve. 

Valuation theory forms a solid link between number theory, algebra, and analysis. On the 
one hand, it permits a precise study of algebraic functions, and on the other hand, it leads to the 
introduction of analogues of classical analytic concepts in the study of arithmetic questions. 

We begin our discussion with real valued valuations, which are now usually called 
absolute values. We can distinguish two types: archimedean and non-archimedean. The latter 
suggests a generalization to valuations into ordered abelian groups, a generalization that was 
introduced by Krull in 1934. In what follows we reserve the term “valuation” for Krull’s 
valuations into ordered abelian groups. Valuations in this sense are equivalent to two other 
concepts: valuation rings and places. We study in some detail the problem of extension of 
absolute values and valuations to extension fields, especially finite dimensional extensions. In 
particular, we consider this problem for the field Q. 

We define the important concept of a local field that is of central interest in algebraic 
number theory and we give a topological characterization of these fields and of finite 
dimensional division algebras over local fields. Our method of treating these is an elementary 
one that yields at the same time a determination of the Brauer group of a local field. We 
consider also quadratic forms over local fields and define an invariant for these that together 
with the discriminant provides a complete classification of these forms. 


9.1 ABSOLUTE VALUES 


We assume familiarity with the properties of R as an ordered field such as are developed in a 
beginning calculus course (e.g., completeness, existence of logarithms, etc.). The concept of an 
absolute value on a field is a direct generalization of the absolute value defined on the field €. 
This is given in 
DEFINITION 9.1 An absolute value || on a field F is a map a » |a | of F into R such that 
(1) |a|=0and|a|=0 ifand only ifa=0. 
(2) |ab|=|a||5|. 
(3)|a+b|<|a|+ |b | (triangle inequality). 


As indicated earlier, the classical example is that of € in which | a | = (a? + f?)!? for 
a=a+p./—1,o,feR. We shall use the notation | | „ for this absolute value if it is necessary to 


distinguish it from other absolute values. On the subfields R and @ this reduces to the usual 
absolute value. We proceed to list some less familiar examples. 


EXAMPLES 


1. p-adic absolute value of Q. Let p be a fixed prime in Z. If a # 0 in Q, we write a = 
(b/c)p* where k e Z and (b,p) = 1 = (cp). The integer k is uniquely determined by a. We 
denote it as v,(a) and we define v,,(0) = oo. Then it is easy to verify the following properties: 


(i) v,(a) = œ if and only ifa =0. 


(ii) v,(ab) = v,(a) + v,(b). 
(iii)v,(a + b) > min (v,(a), v,(b)). 


Now let y be a real number such that 0 < y < 1 and define a p-adic absolute value | a |, on@ by 
(1) jalp — yr”) 


(where y” = 0). Then it is clear that this is an absolute value in the sense of Definition 9.1. In 
fact, in place of the triangle inequality we have the stronger relation 


(2) |a+b| < max(|al, |b) 


for | |= | lp- 


2. Let F(x) be the field of rational expressions in an indeterminate x and let p(x) be a prime 
polynomial in F[x]. If a € F(x) and a ¢ 0, we have a = p(x)*b(x)/c(x) where k € Z and 
(p(x),b(x)) = 1 = (p(x), c(x)). We define v,(a) = k, v,(0) = œ, and | a |, = y’rP for some real y, 
0 <y <1. Then properties 9.1.1 and 9.1.2 and equation (2) hold. Hence we have an absolute 


value on F(x) defined by | a |, = y’P. 

An important special case of this type of absolute value is obtained by taking F = € and 
p(x) =x—r,r € €. An element a € C(x) defines a rational function on the Riemann sphere € U 
{oo} in the usual manner. If v,(a) = k > 0, then this rational function has a zero of order k at r 
and if v,(a) =— k, k > 0, it has a pole of order k at r. Finally if v,(a) = 0, then a has neither a 
zero nor a pole at r. Thus the value of v,(a) gives us information on the behavior of a in the 


neighborhood of the point r. 


3. We obtain another absolute value on F(x) in the following manner. If a + 0, we write a = 
b(x)/c(x) where b(x) = bo + byxt+ ... + b,x”, C(x) = co t cix +... + ¢,x", b,c, € F, bm EO, C, 
+ 0. Define v,,(a) =n — m, v,,(0) = 0, and | a |p = y"" where y €R and 0 < y 1. We have 
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Hence the definition of |a|,, amounts to using the generator x! for F(x) and applying the 
procedure in example 2 to F[x~!] with p(x!) = x71. Hence | |, is an absolute value. 

In the special case of F = €, V,,(a) gives the behavior at œ of the rational function defined 
by a. 

For any field F we have the trivial absolute value on F in which|0|=0 and|a|=1ifa# 
0. 


We now list some simple properties of absolute values that follow directly from the 
definition: 


Al ex 1, lu| = 1 if u" = 1, |—al = |a 


la~*|=|a|"* ifa#0,  llaļ-|b]|o <lļa-bl. 


An absolute value on F defines a topology on F whose open sets are the unions of the 
spherical neighborhoods where such a neighborhood of a is defined by {x| x — a| < r} for some 
r > 0. It is easy to see that this gives a topology on F and that multiplication, addition, and 
subtraction are continuous functions of two variables in the topology. We can now define 
convergence of sequences and series in the usual way. Thus we say that {a,|n = 1, 2,...} 


converges to a if for any real ¢ > 0 there exists an integer N = N(e) such that 


|a- a| < € 


for all n > N. In this case we write also lima, = a or “~“'* 7 if it is necessary to indicate | 
|). The standard elementary facts on convergence of sequences in € carry over (a, — a and b, 
— b imply a, + b, > a + b; a,b, — ab; |a,| — la| in R; if a, — a + 0, then a, # 0 for 


-1 _, a! etc.,). We define convergence of a series 


sufficiently large k, say, n > k, and then a, ,; 
Èf a and write Y a, =s ifs, — s for the sequence of partial sums s, =a; +... + a,- 

It is natural to consider two absolute values | |, and | |, as equivalent if they define the same 
topology on F. For example, if | |, and | |’ , are p-adic valuations defined by y and y' 
respectively, that is, | a |, = y’P, | a |, = y™®, then | a |= | a |,° for s = log y'/log y > 0. 
Hence any spherical neighborhood of a point defined by one of these absolute values is a 
spherical neighborhood defined by the other. Hence | |, and | |, define the same topology. This 


is the case for any field F and any two absolute values | | and | |’ =| |’ where s is a positive real 
number. We remark also that the topology defined by an absolute value | | 1s discrete if and only 
if | | is trivial: It is clear that the trivial | | defines the discrete topology. On the other hand, if | | 


is not discrete, then we have an a such that 0 < | a |< 1. Then“ 7° and the set of points {a”} is 
not closed in F, so the topology is not discrete. It is now clear that the only absolute value 
equivalent to the discrete one | | is | | itself. For nontrivial absolute values we shall now show 
that equivalence can hold only if each absolute value is a positive power of the other. 
Moreover, equivalence can be assured by a simple one-sided condition. Both of these results 
follow from 


THEOREM 9.1. Let | |, and | |, be absolute values of a field F such that | |, is not trivial 
and |a| < l1 fora € F implies that | a h < 1. Then there exists a positive real number s such 


that | h =| |\° (that is, |a h = | 45 for all a #0). Hence ||, and ||, are equivalent. 


Proof (Artin). The hypothesis implies that if |a|; < |b|,, then | a h < | b h and hence |a|,>1=|1 
|; implies that | a |, > 1. Since | |; is non-trivial, we can choose an ag such that | ag ~; > 1 and 
hence | ap ~ > 1. Now let a be any element such that | a |, > 1 and hence | a |, > 1. Let 


t = log la]; /log ldo|1. 


Then ¢ > 0 and | a |, =| do ht. We claim that Jah = |ag|2’ also. We have |aly = ag)” with t'> 0 and 
if t'# t, then there exists either a rational number rsuch that r > t and r < t' or a rational number 
r such that r < t and r > ¢’. In the first case, |dgl" < lah and in the second |ap|,” > |a|,. Our claim 


can therefore be established by showing that if r is a rational number >t, then |agly” > lah and if 
r is a positive rational number <z, then |ap|," < lah. Write r = m/n where m and n are positive 
integers and suppose first that r > t. Then |a|, < |a|™" and |a”|, < |ag™|,. Then |a”|) < ja)” and 


lal < |agl”””. Similarly, ifr = m/n < t, then |a|, > aoh”. Hence we have |a|, = laọht and 


_ log|alz _ log ali 
log |ao|, log |do| 
and 
log |a|, _ log |ao|2 
log|al; log |aohi 


Then ļah = la|; for s = log |ap|,/log |ap|, > O and this holds for all a such that |a|, > 1. Then if lah 


< 1, we have |a||, = |a"||,° and hence also Jah = |a|,°. Thus lah = |a|; and we have seen that this 
implies equivalence of the absolute values. 


We have observed that the p-adic absolute value on @ and the absolute values on F(x) 
defined in examples 1 and 2 satisfy the stronger triangle inequality that |a + b| < max(|a|,|b|). We 
call an absolute value non-archimedean if it has this property; otherwise we say that the 
absolute value is archimedean. This distinction can be settled by looking at the prime ring; for, 
we have 


THEOREM 9.2. An absolute value || of a field F is non-archimedean if and only if |\n\| < 1 
for alln e Z. 


Proof(Artin). If | | isnon-archimedean, then |a; + ... + a,|< max la. Hence |1+...+ 1] <|l|=1 
and |71| < 1 for all n € Z. Conversely, suppose this holds and let a, b e F. Then for any 
positive integer n we have 


|a+b|" = |a" +(i)a"7 tb + +b" 
< |a|" +a" *| |b] ++ |b|" 
< (n + 1) max (|aļ", |b|"). 
Hence |a + b| < (n+ 1)!” max(laļ, |b). Since lim(n + 1)!” = 1 in R, this implies ja + b| < 
max(|a|, bl). 
A consequence ofthis criterion is 


COROLLARY 1. Any absolute value on a field of characteristic p # 0 is non- 
archimedean. 


Proof. If n1 is in the prime ring and n1 £ 0, then (n1)! = 1 and hence |nl| = 1. Also |0| = 0. 


Hence || is non-archimedean, by Theorem 9.2. E 


The trivial absolute value is non-archimedean. It is clear also from Theorem 9.2 that if | | is 
non-archimedean on a subfield of a field F, then | | is non-archimedean on F. Hence we have 


COROLLARY 2. /f|| is trivial on a subfield, then || is non-archimedean. 


EXERCISES 


— 


. Show that if | | is an absolute value and 0 < s < 1, then | |f is an absolute value. Show also 
that if | | is non-archimedean, then | |f is an absolute value for every s > 0. 

2. Show that if | | is non-archimedean, then |a + b| = |a| if |a| > |b|. Show also that ifa, +... + 

a, = 0, then |a,| = |a;| for some i #7. 


3. Let F be a field with a non-archimedean absolute value | | and let f(x) =x" + a,x! +... 
+a, € F[x]. Show that if M = max(1,|a,|) then any zero p of f(x) in F satisfies |p| < M. 


4. Let || be an absolute value on E and assume that | | is trivial on a subfield F such that E/F 
is algebraic. Show that | | is trivial on £. 


5. Let || be an absolute value on F and let o be an automorphism. Show that | |, defined by 
lal = |oa| is an absolute value. Apply this to F= @%/2) and %/=") with c £ 1. In each case 
determine whether or not | |, 1s equivalent to | |. 


9.2 THE APPROXIMATION THEOREM 


In this section we prove a theorem on simultaneous approximation with respect to inequivalent 
absolute values that is an analogue of the Chinese remainder theorem in ring theory. This will 
imply a strong independence property for inequivalent absolute values. The result is the 
following 


APPROXIMATION THEOREM (Artin-Whaples). Let ||, | |, be inequivalent non-trivial 
absolute values on a field F, a}, ..., a, elements of F, € a positive real number. Then there 
exists an a E F such that 


(3) ja-a <e 1<k<n. 


We shall prove first 


THEOREM 9.3. Let | |1,. 
any k, 1 < k <n, there exists an a, € F such that 


(4) AN "h Car <1 läk 


Proof. It suffices to prove this for k = 1. Hence we have to show that there is an a such that |a|, 
> 1 and jaļ| < 1 for every l > 1. First let n = 2. By Theorem 9.2 there is a b € F such that |b|; < 1 
and |b|, > 1 and ac € F such that |c|, < 1 and |c|, > 1. Then if we put a = ch"! we have |a|, > 1 
and |a|, < 1. Hence the result holds for n = 2 and we may assume it for n — 1 >2. Then we have 


.., | |, be inequivalent non-trivial absolute values on F. Then, for 


elements b and c such that 


bei Eh <1,..., 


bln-1 < l, 
eh>1, Iele<l. 
We distinguish two cases. 


Case I. |b|,, < 1. Consider a, = b"c. We have |a,|, = lol lch. This is > 1 ifk= 1 and < 1 ifk > 
1 and r is sufficiently large. For any such r put a = a... Then (4) holds. 


Case Il. |b|, > 1. Here we take a, = b'/(1 + b")c. If2 < k < n — 1, |blk < 1 so %™.%. Then 
amO and la, < 1 for sufficiently large r. Next let k = 1 or n. Then |b|, > 1 and hence 


P+ H)=WA+b I! and ae Since |c > 1 and |c < 1, we have jal > 1 and |a,|, < 1 for 


sufficiently large r. Hence for a suitable a = a,. we have |a|, > 1, lah < 1,... lal, < 1. 


We can now prove the approximation theorem. Let | |1,...,||,, be inequivalent and non-trivial 
and let a,,...,a, be elements of F. For each k, 1 < k < n, apply Theorem 9.3 to obtain an 
element b, such that |b; |, > 1 and |b|; < 1 for all / #4. Then 


by"/(1 + by’) y l 
by"/(1 +b’) 7,9 ifl Ak. 


hy’ l F r r r “a Pj P A A Š 
Hence SAUA I and MWUHEIO for 7 + k, Then ab +a for every j, 1 <j <n. 
Hence for any e > 0 we can take 


n 


‘= y Ay, by.” /(1 + by”) 


k=1 


for a sufficiently large r and we shall have the required relations |a — a; <u, 1<k<n. LJ 


We have seen that if | |; and | |, are equivalent, there exists a positive real s such that | |, = | 


l. On the other hand, it is clear from the approximation theorem that if | |; are 


ath 
inequivalent and non-trivial, then there exists no (s1,..., 5,,) # (0,..., 0), S; real, such that 


lal}'|a feela = | 


for all a e F. For we may assume s; = 1. Then, by the Approximation Theorem, there exists an 
a; such that |a, < 1/2! and allj < 1/2! for j > 1. Then |a; — 0 and la; 1 and hence the 
foregoing relation cannot hold for all a;. 


EXERCISE 


1. Ifp is a prime, we define the normalized p-adic absolute value on 9 by |a|, = p` pa), Let 
| | be the usual absolute value. Show that I,|a[,= lal! for all a € Q in the sense that for 


a given a only a finite number of |a|, are # 1 and the product of these is lal !. This can 
also be written as IT, lal} = 1 if q ranges over the primes p € Z and o. 


).3 ABSOLUTE VALUES ON 0 and F(x) 


We shall now determine all of the absolute values on Q and all of the absolute values on a field 
F(x), x transcendental, that are trivial on the subfield F. We begin with @ and we prove 


THEOREM 9.4. Any archimedean absolute value | | on Q is equivalent to the ordinary 
absolute value | |,,. 


Proof (Artin). Let n and n’ be integers > 1 and write 
n =dptant+:+aqn, O<a<n. 
Then 
|n| < n(1+|n| ++ + |n|") <n(k + 1)max (1, |n|") 


and since n' > n*, log n' >k log n, k < log n'/log n, so 


In| <n tr + man |n|'08 n'/log n), 


Replacing n' by n” for any positive integer r we obtain 


r log n’ > 
In| <n g +] max (1, |n|” 08 ogn), 
log n 
Then 


r log n' tje | 
In'| < C a + } max (1, |n]!°8"/8"), 
0 


1/rlog(ra + b) = 0. Hence 


— 00 


For any real numbers a and b, lim,» (ra + b)!” = 1 since lim, 
the last inequality gives 


(5) In 


for any n, n' such that n, n' > 1. Since | | is archimedean, there exists an n’ such that |n'| > 1 
(Theorem 9.2). Then |n’| < jn" " so |n| > 1 for all n > 1. Then (5) holds for all n, n' > 1. 
Hence 


< max (1, In| log n'/log n) 


|n'| Ilogn < In| I /log n 


for all n, n' > 1. By symmetry, we have 


|n'| I/logn’ — |n| 1/log n 


Then log |n'/log n' = log |n|/log n = s > 0 and so |n| = n5 for all integers n > 1. It follows that |a| 


= |a|" for all a + 0 in Q and hence | | is equivalent to ||,,. 
We determine next the non-archimedean absolute values on Q. 


THEOREM 9.5. Any non-trivial non-archimedean absolute value of 9 is a p-adic absolute 
value for some prime p. 


Proof. We have |n| < 1 for every integer n. If |n| = 1 for all n + 0 in Z, then | | is trivial. Hence 
the set P of integers b such that |b| < 1 contains nonzero elements. Now P is an ideal in Z, since 
|b, + b| < max (lb; |, b2) < 1 ifb; € P and |nb| = |n| |b| < 1 ifn € Zand be P. Also P is prime 
since |n| = 1 and |n'| = 1 imply that |nn'| = 1. Hence P = (p) for some prime p > 0. Now put y = 
lp| so 0< y< 1. Ifr e Q, we can write r = p*a/b where k € Z and a, b ¢(p). Then |a| = 1 = |b| 


and |r| = y% = PP) Hence | | is the p-adic absolute value defined by y. E 


We consider next the case of F(x) and we prove 


THEOREM 9.6. Let | | be a non-trivial absolute value on F(x), x transcendental, that is 
trivial on F Then || is one of the absolute values defined in examples 1 and 2 of section 9.1. 


Proof. Since | | is trivial on F, it is non-archimedean. We distinguish two cases. 


Case I. |x| < 1. In this case, the fact that | | is non-archimedean and trivial on F implies that 
b| < 1 for every b e F[x]. Since | | 1s not trivial on F(x) we must have a b e F[x] such that 0 < 
b| < 1. The argument proceeds as in the proof of Theorem 9.5. We let P be the subset of b € 
F [x] such that |b| < 1. This is an ideal (p) where p = p(x) is a prime polynomial. Then one sees 
that | | is an absolute value determined by p(x) as in example 2, p. 539. 


Casell. |x|>1. Let b = bọ + bix + ...+ b,x", b; € F, bp, £ 0. Then |b,,"| = kf” > |bx'| for i < 
m. Hence |b| = {x/™ (exercise 2, p. 542). If we put x| = y7}, 0 < y < 1, it is easy to check that | | is 


an absolute value | |,, as defined in example 3, p. 539. 


9.4 COMPLETION OF A FIELD 


The familiar concepts of convergence, Cauchy sequence, and completeness in R and €, which 
we have also encountered in considering topologies defined by ideals in a ring (section 7.17), 
carry over to fields with an absolute value. For arbitrary F and | | we have the following 
definitions. 


DEFINITION 9.2. Let F be a field with an absolute value | |. A sequence {a,} of elements 


of F is called a Cauchy sequence if given any real e > 0, there exists a positive integer N = 
N€) such that 


(6) [am — an| < E 


for all m, n > N. F is said to be complete relative to | | if every Cauchy sequence of elements 
of F converges (lima, exists). 


As in the case of €, it is clear that ifa, — a, then {a„} is a Cauchy sequence 


We shall now show that any field F with an absolute value | | has a completion F in the 
sense that 


(1) Ê is an extension field of F and has an absolute value that is an extension of the given 


absolute value. 
(2) F is complete relative to the absolute value. 
(3) F is dense in Ê relative to the topology provided by the absolute value. 


Since the steps of the construction of Ê are almost identical with familiar ones for real 
numbers and for metric spaces, we shall just sketch these and leave the verifications to the 
reader. We begin with the set C = C(F) of Cauchy sequences of elements of F. This can be 
made into a commutative ring extension of F. If {a,}, {b,} € C, we define {a,} + {b,} = {a, 


+ b}, {a,} {b,} = {a,b,}. These are contained in C. If a e F, we let {a} be the constant 


sequence all of whose terms are a. Then (C, +, +, {0}, {1}) is a commutative ring containing 
the subring of constant sequences that is isomorphic under a ~ {a} with F. 


A sequence {a,} is called a null sequence if a, — 0. Let B be the set of null sequences. It 


is easily seen that B is an ideal in C. It is clear that B N {F} = {0} where {F} is the set of 
constant sequences {a}, a € F. Hence B is a proper ideal in C. We have the 


LEMMA. B is a maximal ideal of C. 


Proof. Let B' be an ideal of C such that B ? B and let {a,} € B’, € B, so {a,} is a Cauchy 
sequence that is not a null sequence. Then there exists a real number 7 > 0 and an integer p 
such that |a,| > for all n = p. Define b, = 1 ifn <p and b, =a, ifn = p. Then {a,} — {b,} = 
{c,} € B. Also it is easy to check that {b-!,} € C. Then {1} = {b 71} {b,} = {57l} {a,} - 


{bl} {c,} € B'. Hence B’ = C and so B is a maximal ideal. 


We now put Ê = C/B. This is a field. The canonical homomorphism of C onto Ê maps {F} 
onto a subfield that we can identify with F. Thus we identify an element a of F with the coset 
{a} + B in C/B. 


We now introduce an absolute value | | on F. Let {a,} € C. Then the inequality |la,,| — lanlo 
< |a, — a,,| shows that {| a„ |} 1s a Cauchy sequence of real numbers. Hence limja,„]| exists in R. If 
{b,,} € B, then b, — 0 and |b, |— 0. This implies that if {a',} € {a,} + B, then lim |a’,| = lim 
la„|. Hence if we define 


{dn} + Bl = lim |an 


’ 


we obtain a map of Ê into R. Since a| >0, it is clear that the values of | | on Ê are non-negative. 
Also since |a,|— 0 implies that a, — 0, it is clear that |a| = 0 for a € Ê if and only ifa = 0. It 
is straightforward to verify that | | satisfies also conditions (2) and (3) for an absolute value 
(see Definition 9.1). Hence | | is an absolute value on Ê. It is clear also that | | on Ê is an 
extension of || on F if we identify F as before with the subset of elements {a} + B in Ê. 

We show next that F is dense in Ê. This can be seen by showing that any a e Ê is a limit of a 


sequence of elements of F. In fact, ifā = {a,} + B, then it is easy to see that lima, = å. 

Now let {a,,} be a Cauchy sequence of elements of Ê. Then there exists an a„ € F such that | 
a,,—a,|< 1/2”. It follows that {a,} is a Cauchy sequence in F and lima, =a where a = {a,} + 
B. Thus Ê is complete and we have proved the existence of a completion of F. 


We now consider the question of uniqueness of the completion. More generally we consider 
two fields Ê, i = 1, 2, which are complete relative to valuations | |; and let F; be a dense 


subfield of Ê; Suppose we have an isomorphism s of F} onto F; that is isometric in the sense 
that |a|, = |sah for a e F. Then s is a continuous map of F into Ff, and since F} is dense in Fy, s 
has a unique extension to a continuous map §: of Ê; into F,. This is easily seen to be a 
homomorphism, and since 5™ is a homomorphism and s~!s = lp and ss~! = 1 F, imply s's:= 1 
fp 5's ‘1p, it follows that s: is an isomorphism. It is clear also that 5: is unique and is 


isometric. 
We state these results in 


THEOREM 9.7. (1) Any field F with an absolute value has a completion F (2) If Ê; i = 1, 2, 
is complete relative to an absolute value and F; is a dense subfield of f; then any isometric 
isomorphism of F; onto F, has a unique extension to an isometric isomorphism of F\ onto Ê 


2° 


If we take s = 1p, then this proves the uniqueness of the completion up to an isometric 
isomorphism that is the identity on F. 

If we complete @ relative to its usual absolute value | |, we obtain classically the field R of 
real numbers. On the other hand, the completion of Q relative to a p-adic absolute value |a|, = 
yo, 0<y <1, can be identified with the field @, of p-adic numbers as defined on pp. 74-75. 
To see this we first consider the closure Ê of Z in the completion @ relative to | |,. An element a 


€e “is the limit in @ of a sequence of integers a; Such a sequence may be assumed to satisfy a; 


a; (mod pì, i <j, and so determines an element of the inverse limit Z, = lim z/(p'). It is 
straightforward to verify that we have an isomorphism of ? with & mapping lim a; into the 
corresponding element of Z,. Hence we can identify these two rings. We observe next that 
since {|a|,|a € Q} is closed in R this set is identical with {|a| |a € @}. Hence given any P # 0 in 
A there exists an e € Z such that |p| = p°], so a = Bp © satisfies |a| = 1. Then a = lim a; where a; 
= b/c; and (b; p) = 1 = (c; p). Now there exists an x; € Z such that x; c; = b,mod p^). Then |x; — 
b/c; |= |p'| and so a = lim x; € E = 2 It follows that every element of @ has the form apf, a € Z 
p € <0. Evidently this implies that @ is the field of fractions of Z, so Q is the field Q, of p-adic 
numbers. 


EXERCISES 


In exercises 1—3, F is assumed to be complete relative to an absolute value | |. 


1. Show that if È a, converges in F, then lim a, = 0. Show that the converse holds if | | is 
non-archimedean. 


2. Show that if k is a natural number then 


k h k 
SMEG 
p p p 


and hence v,(k!) < k(I/p — 1) (cf. exercise 15, p. 84 of BAD. Use this to prove that the 
power series 


expz = Y ziji! 
0 


converges for all z € Q,, p + 2, such that |, < 1. 
3. Show that exp(x + y) = (exp x)(exp y) if x|, y| < 1. 


9.5 FINITE DIMENSIONAL EXTENSIONS OF COMPLETE FIELDS. 
THE ARCHIMEDEAN CASE 


One of the central problems in valuation theory is: Given a field F with an absolute value | | 
and a finite dimensional extension E/F, can | | be extended to an absolute value on £? We are 
interested also in determining all of the extensions. We shall now consider this problem for F 
complete especially in the case in which the absolute value is archimedean. We prove first a 
uniqueness theorem. 


THEOREM 9.8. Let || be a non-trivial absolute value on F such that F is complete 
relative to | | and let E be a finite dimensional extension field of F Suppose that | | can be 
extended to an absolute value || on E. Then this can be done m only one way and || on E is 
given by the formula 


(7) |a| an |Neve(a)| E”. 
Moreover, E is complete. 


Proof. Let {u;,...,u,} be linearly independent elements of E, {a„} a sequence of elements of 
the form % = Èf-ı awu where the a,; € F. Then {a„} is a Cauchy sequence if and only if the r 


sequences {a,;}, 1 < i <r, are Cauchy sequences in F. In one direction this is clear: If every 
{Qi} is Cauchy, then so is {a,}. Conversely, suppose {a,} is Cauchy. If r = 1, then clearly 
{az} is Cauchy. We now use induction onr. If {a} 1s Cauchy, then the sequence b, where b, 
= a„ —,,U,, is Cauchy and since = Li! aww, the result will follow by induction in this case. 
Now suppose {q,,.} is not a Cauchy sequence. Then there is a real ¢ > 0 such that for any 
positive integer N there exist integers p,q = N such that |a,,.— 0,,,| > £. Then there exist pairs of 
positive integers (P, qx), Pi < P2 <... q1 < q2 < ... such that lap, — Og.,| > £. Then (ap, — 


Spe) exists in F and we can form the sequence {b} where 


(8) by = (Opr = ar) ' (ap, E Ai, 

Since |ap r — Ogr| | < €! and {ap, — a, )— 0, we have b, — 0. On the other hand, 
py 

(9) b, = j3 Pkjuj+ Ur, 
j= 


so if SÈT fyura> -u and so {c,} is Cauchy. Hence, by the induction hypothesis, every 
{B;;} is Cauchy and since F is complete relative to | |, B; — Bj € F. Then taking limits in (9) 
we obtain L3-! 4+ =9 contrary to the linear independence of the u;. This proves that every 
{0,,;+ is Cauchy. It is now clear also that if a, — 0, then every a, — 0. Also, if we take (u, 
...,U,.) to be a base, then we see that E is complete. It remains to prove the formula (7). If this 
does not hold, we have an ¢= Li%™ such that |a"| # |N--a)| (r = [E:F]). By replacing a by a“! 
if necessary we may assume that |a”| < |N;/-(a)|. Put b = a"Np{ay!. Then |b| < 1 and Mb) = 
NpyAb) = 1. Since |b| < 1, b” — 0. Hence if we write "= Li Fu“, then B„; > 0 for every i. Since 
the norm map is a polynomial function from Æ to F, it is continuous. Hence B,,; > 0 implies that 


1 = N(b") — 0. This contradiction proves (7). E 


We shall determine next the fields that are complete relative to a real archimedean 
valuation. We shall see that the only possibilities are R and €. We shall need the following 
result, which is applicable in the archimedean case since these fields have characteristic 0. 


LEMMA. Let F be a field of characteristic + 2 that is complete relative to a real valuation 
|| and let E be a quadratic extension of F. Then (7) 


|a| = \Nej(a)| 1/2 


defines a valuation of E that is an extension of ||. 


Proof. E is Galois over F. Let a ~ @ be the automorphism of E/F that is not the identity. Then 
T(a) = a+ @ and Na) = aa and a’ — T(a)a + Na) = 0. If a € F, then Ma) = a? and the 
definition shows that the map a ~ |Nz,Aa)| "7 is an extension of the absolute value on F. 
Evidently |a| as defined by (7) is 20 and |a| = 0 if and only if N/a) = 0, hence if and only if a 
= 0. The condition |ab| = ļallb| follows from the multiplicative property of the norm. It remains 
to prove that ja + b| < |a|+ |b|. The multiplicative property permits us to reduce this to proving 
that la + I| < ja| + 1. Since this holds in F, we may assume that a ¢ F. Then E = F(a) and the 


minimum polynomial of a over F is x? — T(a)x + N(a) = (x — a)(x — a). Then Ma + 1) = (a +1) 
a +1) = N(a) + T(a) + 1 and we have to show that |M(a+1)| <(\Ma)|"? + 1}. This is equivalent 
to 


1+ T(a)+N(a)| <1+2|N(a@)|"?+|N(Q)]. 


Since ja + B| < jal + |B| in F, this will follow if we can show that |7(a)| < 2|M(a)|!? or, 
equivalently, |7(a)’ < 4|N(a)|. Suppose to the contrary that | NAP > 4|M(a)|. Then we shall 
show that a € F contrary to E = F(a). 


We consider the equation x? — ax + B = 0, a = T(a), B = Ma), whose roots in E are a and a. 
We shall show that this has a root in F and this will prove that a € F and will give the required 


contradiction. The given equation is equivalent to x = a — Bx"! and we have |a? > 48]. We shall 
obtain a solution of this as a limit of a sequence of elements {y,} in F. We define the y, 


recursively by 
Aj — 1 $ ume cmt! 
y1 = 7a, Yn+1 = A— By, i 


To see that this makes sense we have to show that no y,, = 0. We have l»: = 4l2!>®, so y, #0 and 
assuming |?*| 4/2] we have 


Masail = a- Brn | > [a] —[B 


“* > Jal—2)p]lal~! > [al —4]al?lal* = o 


A) 
in 


Hence every l»| >ła]> 0, so y, #0. Now Yns2— Yus: = Bias Ya "(a1 —7). Hence 
<4 
[Ya+2 — Yn+ | nS | 2 n+ 1 — Yal» 
a| 


so if we put r = 4|B\/|a? < 1 we obtain ly, +2- Yn + 1| < 7" — yıl. This inequality implies that 
{Y,} is a Cauchy sequence. Hence y = lim y, exists in F and if we take limits in the relation y, 


4+, =a- By, | we obtain y = a — By"! and y? — aß + B = 0. This completes the proof. E 


We can now prove 


OSTROWSK?’S THEOREM. The only fields that are complete relative to an 
archimedean absolute value are R and €. 


Proof (A. Ostrowski). Let F be complete relative to the archimedean absolute value | |. Then F 
contains @ and since the archimedean absolute values of @ are equivalent to the ordinary 
absolute value, the closure @ of @ in F can be identified with R and the restriction of | | to R is 
equivalent to the usual absolute value | |. If F contains an element i such that i7 = — 1, then F 
contains a subfield R(i) that can be identified with €. Moreover, by Theorem 9.8, the restriction 
of | | to € is equivalent to the usual | |,. If F contains no i such that i? =— 1, we can adjoin such 
an element to F to obtain E = F(i). By the Lemma and Theorem 9.8 the absolute value | | has a 
unique extension to an absolute value || on E£ and E is complete relative to | |. The foregoing 
considerations apply to E in place of F and if we can show that E = €, it will follow that F = R. 
Hence the proof has been reduced to showing that if F satisfies the hypotheses and, moreover, 
F contains € and the restriction of | | to € is equivalent to | |,,, then F = €. 


We proceed to show this. Suppose that F 2? € and let a e F, € ©. We consider the map x ~ |x 
-a| of€ into R. Since the topology induced by || on € is the usual one, it is easily seen that this 
is a continuous map of € into R. Let r = infly — a| for y e €. We claim that there exists a yọ € € 
such that |yọ — a| = r. First, it is clear that if we restrict the y so that |y — a| < r + 1, then inf ly — 
a| for these y is also r. Hence we consider the y € € such that ly — a| < r + 1. If y; and y, are 
two elements of this set, then ly} — y2| < 27 + 2. Hence the y such that ly — a| < r + 1 forma 
closed and bounded set in €, so by the continuity of x [x — a| we see that there is a yọ € € such 
that lyo — a| = r and hence fy — a| = {yp — a| for all y e €. Now let D = {y e €] |y — a| = r}. This 
set is non-vacuous and is closed since |x — a| is continuous. The argument used before shows 
that it is bounded. We shall now show that D is open and this will give a contradiction. The 
openness of D will be proved by showing that if y' € D, then every y such that |y’— y| < r is in 
D. If we replace a by y' — a and call this a, this amounts to showing that if la| = r and |y| < r, 


then ja — y| = r. To see this, let n be a positive integer and consider a” — y” = (a — y)(a — ey)... 
(a —s"~ !y) where ¢ is a primitive nth root of 1 (in €). Then 


[a-y] |a—ey| p |a- e" ty] = |a"—y"| 


<al"+|)|" 


„jn 
la- y|r"7' < ar(: = uM 


a" 


<r(1 +m 
z 


Since |a — e“y| > r, this gives 
7 gl 


Hence Ja — y| < r(1 + }y|"/r") and so if |y| < r, then lim | +(ly|/r)”| = 1 gives ja — y| < r. Hence |a — 
y| =r. Thus D is open as well as closed and non-vacuous. Since € is connected, D = €. Then 
for any two complex numbers y; and y>, ly; — Yo < ly; — al + fy2 — a| = 2r. Since this is 


impossible, the proof is complete. 


The extension problem for archimedean complete fields becomes trivial in view of 
Ostrowski’s theorem. If F is such a field with the absolute value | |, then either F = € or F =R. 
In the first case, F has no proper algebraic extension and in the second its only proper 
algebraic extension is C. In both cases | | is equivalent to || „ and if F = R and £E = €, the 


extension is given by formula (7) and E is complete. For future reference we state these results 
as 


THEOREM 9.9. Let F be complete relative to an archimedean absolute value | | and let E 
be a finite dimensional extension field of F. Then || can be extended in one and only one way 
to an absolute value on E and this is given by (7). Moreover, E is complete relative to this 
absolute value. 


The analogous result for non-archimedean absolute values will be postponed until after we 
have generalized this concept to that of a valuation into an ordered abelian group. We shall 
consider this generalization in the next section. After that we shall prove an extension theorem 
for valuations and then specialize this to obtain the extension theorem for non-archimedean 
absolute values. 


9.6 VALUATIONS 


A non-archimedean absolute value | | satisfies the condition |a + bK max(|al, |b). The addition 
of the reals plays no role in any of the defining conditions for such an absolute value; only the 
multiplication and order relation for the set of non-negative reals are involved. This 
observation leads to a generalization of the concept of a non-archimedean absolute value to 
that of a valuation with values in any ordered group plus 0. This generalization has important 
applications to algebraic geometry; moreover, it is more natural than the original concept, 
since this generality is just what is needed to relate valuations to internal properties of the 
field. We consider first the concept of an ordered abelian group. 


DEFINITION 9.3. An ordered abelian group is a pair (G, H) consisting of an abelian 
group G and a subset H such that 


(1) Gis a disjoint union of H O {1} UH! where H != {h "he H}. 
(2) His closed under the multiplication in G. 


Whenever there is no risk of confusion, we shall speak of the “ordered abelian group G” or 
the group G “ordered by H”. We can use H to define an order relation on the set G by 


specifying that g} > g» in Gif g; ! g, € H. It is clear that this relation is transitive. Moreover, 
the order is total ( = linear) since for any (g1, g>), g; € G, one and only one of the possibilities 
21 > 2&2, Z1 = 2&2 & > gı (which we write also as g} < g,) holds. Furthermore, the order is 
compatible with multiplication: g4 > g) implies gg, > gg, for any g. Conversely, if we have an 
abelian group G with a total order relation > compatible with multiplication, then we obtain 
the ordered group (G, H) in which H = {h|h < 1}. This gives an equivalent way of defining 
ordered abelian groups. We remark also that if (G, H) is an ordered abelian group, then so is 
(G, H '). Evidently, if g} > g, in (G, H), then g} < g, in (G, H '). We call (G, H- !) the 
ordered abelian group obtained by reversing order in (G, H). 

We have the usual elementary properties that g, > g’, and g, > g,’ imply g, g> > g'g’ and g} 
> g, implies gr !< g7 !. We note also that not every abelian group G can be ordered, that is, G 
may contain no subset H satisfying conditions 1 and 2. For example, we have this situation if G 
has torsion. For, if G is ordered, then g” = 1 in G implies g = 1. 

If G is ordered by H and G} is a subgroup of G, then G} is ordered by H, = G, N H. The 
order relation > in G} is the restriction to G, of this relation in G. If (G, H) and (G’,H’) are 
ordered abelian groups, an order homomorphism of G into G' is a homomorphism of G into G’ 
that maps H into H’. If (G,, H;), 1 < i < n, is an ordered abelian group, then H; x H, x ... x H, 
does not generally define an order on G, x G, x ... x G,. However, we can obtain an order in 
G=G, x G, X... x G, the lexicographic order (corresponding to the given indexing i ~ G;,), 
by taking H to be the set of elements of the form 


(Lis... lr- 1, Ars Jot 15-3 Gn) 


where 1 <r <n, l; is the unit of G, A, € H, and the g; are arbitrary. It is easy to check that this 
provides an order in G. In this order we have 


(415 J25-++sGn) > (gis J25+++3Gn) 


if and only if for some r, 1 <r <n, we have g) = g’,..., g._ 1 =g',.; and g, > gr. 

We have the following familiar examples: (1) (R, +, 0), the additive group of reals with the 
usual order; (2) the multiplicative group P of positive reals— here the subset H defining the 
order is the set of positive reals < 1 and the map x ~ e* of R into P is an order isomorphism; 
and (3) 2), the additive group of n-tuples of integers ordered by the lexicographic ordering. 

To define general valuations of fields we need to consider ordered abelian groups with an 
element 0 adjoined: V = G U {0}, disjoint, in which we define 00 = 0 and g > 0, 0g = 0 = g0 
for all g e G. We can now give the following 


DEFINITION 9.4. Jf F is a field and V is an ordered abelian group with 0 adjoined, then 
we define a V- valuation of F to be a map ọ of F into V such that 


(i) o(a) =0=a=0. 
(ii) p(ab) = ọ(a)ọ(b) 
(iii)p(a + b) < max(ọ(a), ọ(b)). 


If we take V to be the non-negative reals, so V = {0} U G where G is the set of positive 
real numbers, we obtain the non-archimedean absolute values defined before. 

In the applications one generally encounters valuations in a form in which the ordered 
group is written additively and the order is reversed. Then the element 0 is replaced by an 
element œ such that œ + œ = œ and œ > a, œ + a = œ for all a. The definition of a valuation 
then has the following form. 


DEFINITION 9.4’. Jf F is a field and V is an additive ordered group with œ adjoined, then 
we define an exponential V-valuation of F to be a map v of F into V such that 

G) V(a) =o =a=0. 

(11) v(ab) = v(a) + V(b). 

(iii') v(a + b) = min(v(a), v(d)). 


It is clear that this definition is equivalent to the first one we gave. 

If @ is a valuation of F (as in Definition 9.4) and, as usual, F* denotes the multiplicative 
group of non-zero elements of F, then @(F*) is a subgroup of the given ordered abelian group 
G. We call this the value group of F (relative to the given ọ). There is no loss in generality in 
replacing G by the value group ọ(F*) if we wish to do so. 

As in the special case of non-archimedean absolute values, we have o(1) = 1 for any 
valuation ọ since ọ(1) = ọ(13 = ọ(1). Also (— 1)? = (1) = 1 and since G has no elements of 
finite order # 1, we see that ọ(— 1) = 1. Then ọ(- a) = ọ(- 1)ọ(a) = ọ(a)and @(b) ! = (bY |. 
Also ọ(a + b) = ọ(b) if ọ(a)< ọ(b). 

We shall now give some examples of valuations that are not absolute values. We base the 
construction on a couple of lemmas. 

First, we need to extend Definition 9.4 to domains. If D is a domain and V an ordered group 
with 0, then we define a V-valuation of D as a map ọ of D into V satisfying conditions (1)—(111) 
of Definition 9.4. Then we have 


LEMMA 1. Let D be a domain and let ọ be a V-valuation on D. Then ọ has a unique 
extension to a V-valuation on the field of fractions F of D. 


Proof. Let ab! = cd"! for a, b, c,d € D, bd + 0. Then ad = bc and ọ(ajọ(d) = ọ(b) @(c). 
Hence e(a)o(b) | = e(c)e(d) | and ab- | ~ e(a)o(b) | is a well-defined map of F into V. 
Evidently, this is an extension of the valuation @ of D. Denoting this as @ also, then it is clear 
that (i) and (ii) hold. To prove (iii) it suffices to show that if a, b, d € D, d #0, then (ad ! + 
bd ') < max(g(ad~ '), (bd ')). This is equivalent to (a + b)o(d) ! < max(e(a)o(d) 3), 
~(b)o(d)'), which is obtained by multiplying g(a + b) < max (g(a), (b)) by o(d) |. The 


uniqueness of the extension is clear since p(ab- ') = p(a)g(b) !. E 


We now consider a field F with an exponential V-valuation v where V= G U {oo}. We form 
V' = @ U {œ} where G’ = Z ® G ordered lexicographically : (m, g) > (m', g') if either m > m' 
or m = m' and g > g'. We have 


LEMMA 2. The exponential V-valuation v can be extended to an exponential V'-valuation 
v' of F(x), x an indeterminate. 


Proof. In view of Lemma 1, it suffices to prove the statement with F(x) replaced by the 
polynomial ring F[x]. Let f(x) + 0 in F[x]. Then f(x) = x“ (ag + ayx + ... + a,x”) where k > 0, a; 
e F, and ap £ 0. Define v'(f) = (k, v(ap)) € G'. Then v’ is an extension of v (if we identify V 
with the set of elements (0, u), u € V). Conditions (i^) and (11') of Definition 9.4’ are clear. To 
prove (iii’) it suffices to assume that f is as indicated and g(x) = x(by + byx + ... + b,x” 
where / > 0, b; € F, bọ #0. We may assume also that v'(f) = v'(g). Then either k > / or k = l and 
v(ao) 2 v(bo). In the first case, v'(f + g) = (L, v(bo)) = v'(g) = min(v'(f), v'(g)). In the second 
case, v(f + g) > min(v'(/), v(g))if ag + by = 0 and v'(f+ g) = (k, V(do + bo)) = min (v'), v'(g)) 


if dg + by # 0. Hence in both cases, (i11’) is valid. L 


We can now begin with the trivial valuation .on F and apply Lemma 2 to obtain an 
exponential valuation of F(x,), x; an indeterminate, for which the value group ( = v(F(x1)*)) is 


Z. Adjoining successive indeterminates we obtain an exponential valuation of F(x, ..., X,) 
with value group Z™ ordered lexicographically. We remark that 2“) with n > 1 does not satisfy 
Archimedes’ axiom that if g, and g, are elements of Z™ with g, > 0, then there exists an integer 
n such that ng, > g>. (For example, take g; = (0, 1, ...) and g = (1, ...).) The additive group of 
real numbers does satisfy Archimedes’ axiom. Hence 2“) is not order isomorphic to a 
subgroup of this group and the valuation we have constructed is not an absolute value ifn > 1. 


EXERCISE 


1. Let (G, H) be an ordered abelian group, F a field, and let F[G] be the group algebra of G 
over F. Show that F[G] is a domain. If f= ¥ ag, a; # 0 in F and g, € G, define ọ(f) = 


min g; (in the ordering in G) and define @(0) = 0. Show that ọ is a valuation. Hence 


conclude from Lemma | that for any given ordered abelian group (G, H) there exists a 
field with a valuation whose value group is (G, H). 


).7 VALUATION RINGS AND PLACES 


There are two other concepts, valuation rings and places, that are equivalent to the concept of a 
valuation of a field into an ordered abelian group with 0. The first of these arises in the 
following way. Let ọ be a V-valuation of a field F and let 


(10) R = {ae Fl g(a) <1}. 


This is a subring of F since if a, b e R, then ọ(a — b) < max((a), ọ(b)) soa— b € R and 
e(ab)= ọ(a)ọ(b) < 1 so abe R. Also 1 e R. Now suppose that a ¢ R. Then (a) > 1 and o(a 
1) = (a) ! <1. We therefore see that R is a valuation ring (in F) in the sense of the following 


DEFINITION 9.5. Jf F is a field, a valuation ring in F is a subring R of F such that every 
element of F either is contained in R or is the inverse of an element of R. 


If R is the subring of F defined by (10), then R is called the valuation ring of the valuation 
ọọ. We shall now show that we can turn things around: Given any valuation ring R in F, it gives 
rise to a valuation. To see this, let U be the set of units of R, P the set of non-units, F* the 
multiplicative group of non-zero elements of F, R* = R N F* P* = P N F*. U is a subgroup of 
F*, so we can form the factor group G’ = F*/U. Now put 


(11) H' = {bU|be P*} c G'. 


Then (G', H’) is an ordered abelian group: Ifa € F*, either ae R or a` !e R. If both a anda ! 
e R, thena e UandaU=U=1inG. Ifae Randa '¢ R, thena e P* and aU € H'. Ifaé R, 
then a`! e P* and aU e H ~| It is now clear that we have condition 1 of the definition of an 
ordered abelian group. Now let b4, b € P*, so b; + 0 and b; is a non-unit of R. Then b, b, € 
P* and so Ħ' is closed under multiplication and (G, H’) is an ordered abelian group. Next we 
adjoin an element 0 to G’ to obtain V’ = C to obtain V = G' U {0}. We define a map ọ' of F 
into G' by 


(12) g(0))=0, g(aj=aUeG ifa#0. 


We proceed to show that this is a V’-valuation of F. Conditions (i) and (11) of Definition 9.4 are 
clearly satisfied. Also (iii) is clear if either a = 0 or b = 0. Ifa # 0 and b £ 0, then either ab- l 
e R* or ba’ ! e R*. We may as well assume the first. Then g'(ab- ') < 1 since R* = UU P*. 
Hence (a) < 9'(b). Also ab- ! + 1 € R, so ọ(ab™! + 1) < 1. Then 


(a+ b) = y'(b)p'(ab- ' +1) < ọ'(b) = max (ọ'(a), o'(b)). 


Thus ọ' is a valuation. Moreover, since ọ'(a) < 1 if and only if a € R, the valuation ring 
associated with ọ' is the given ring R. We shall call ọ' the canonical valuation of the valuation 
ring R. 

Now consider again an arbitrary valuation ọ of F into V = G U {0} where (G, H) is an 
ordered abelian group. Let R be the corresponding valuation ring and ọ' the canonical 


valuation of F determined by R. We have ọ'(a) = aU if a # 0, U as before. On the other hand, 
we have the homomorphism a ~ (a) of F* into G whose kernel is U. Hence we have the 
monomorphism ọ'(a) ~@(a) of G' into G, which is an isomorphism of G’ onto the value group 
of ọ. If bU e H' then b e P* and hence ọ(b) < 1. It follows that ọ'(a) ~ (a) is order- 
preserving. We now see that the given valuation ọ can be factored as no’ where ọ' is the 
canonical valuation determined by R and y is an order isomorphism of G’ onto the value group 
of @. 

We shall call two valuations @, and ọ, equivalent if there exists an order isomorphism n of 
the value group of @, onto the value group of ọ, such that @, = nọ. It is clear that this implies 
that ~, and @, have the same valuation rings. Moreover, we have shown that any valuation © is 
equivalent to the canonical valuation ọ' determined by the valuation ring R of @. Hence any 
two valuations @, and @, are equivalent if and only if they have the same valuation ring. 

Let R be a valuation ring in F, so we may assume that R is the subset of F of elements 
satisfying @(a) < 1 for a valuation ọ of F. Let P be the subset of R of elements b such that @(b) 
< 1. Then it is clear that P is the set of non-units of R. Now P is an ideal in R since if b,, b € 
P, then ọ(b; + b2) < max(@(b,), (ọ(b2)) < 1, and if a € R, then (6,a) = ọ(b;)ọ(a) < 1. Hence 
R is a local ring, P is its maximal ideal and R = R/P is a field. This field is called the residue 
field of R or of the valuation ọ. We have the canonical homomorphism a ~ a + P of R onto the 
field R. This leads us to introduce still another concept that is equivalent to valuation into an 
ordered group and valuation ring: 


DEFINITION 9.6. Jf F and A are fields, a A-valued place ? of F is a homomorphism of a 
subring R of F into A such that if a ¢ R, then a ! € Rand #a ')=0. 


Evidently the subring R specified in this definition is a valuation ring. On the other hand, if 
R is any valuation ring in F, then we have the canonical homomorphism of R onto the residue 
field. Moreover, if a ¢ R, then ọ(a)> 1 and g(a ')< 1 so a! is a non-unit in R. Then a~ ! e P 
and a ! + P = 0. Hence a ~a + P is a place # of F. We shall call this the canonical place of 
the valuation ring R. As in the case of valuations, if # is a A-valued place, R the associated 
valuation ring, and #’ the canonical place of R, then we have a monomorphism A of the residue 
field of R into the field A for which we have the factorization # = A2". The introduction of the 
concept of a place permits one to transfer the problem of extension of a valuation from a field 
F to an extension field to one of extending homomorphisms. We shall treat these questions in 
the next section. 

We remark that everything we have done here can be formulated also in terms of 
exponential V-valuations. If v is such a valuation, then the corresponding valuation ring is 


R = {ae F|v(a) 20} 
and the maximal ideal is 


P = {be F|v(b) > 0}. 


For our purposes it will be somewhat more convenient to work with the valuations 9. 
However, as we have indicated, the reader will generally encounter the exponential valuations 
in the applications, especially to number theory. 


— 


EXERCISES 


. Let | |, be the p-adic absolute value of @ determined by y, 0 < y < I, as in section 9.1. 


Determine the value group, valuation ring and ideals of non-units for @ and its 
completion @,. Show that the residue field is isomorphic to 2/(p). 


. Let # be a A-valued place on f with valuation ring R, # a A’-valued place on A with 


valuation ring R’. Show that the composite #’ # is a A’-valued place with valuation ring 
S={ae R|Aaje R}. 


. Let D be a factorial domain, F its field of fractions, and let p be a prime in D. Show that 


the localization Do) is a valuation ring in F. 


. Let E = F(é,, ..., Čm) the field of rational expressions in m indeterminates ¢;. Show that 


there exists an F- valued place on E such that F is contained in the valuation ring and # 
(¢;) = a; for any prescribed elements a; of F. 


1.8 EXTENSION OF HOMOMORPHISMS AND VALUATIONS 


In this section we shall prove a basic theorem on extensions of a homomorphism of a subring 
of a field. This will enable us to prove a general theorem on extensions of a valuation of a 
field to any extension field. The key to everything is the following simple result. 


LEMMA. Let R be a subring of a field F, M a proper ideal in R, a a non-zero element of F, 
R{a] and R{a '] the subrings generated by R and a and a’ |, respectively. Then either the 
ideal MR{a] is proper in R{a] or MR[{ a '] is proper in Ra "] 


Proof. Ifnot, we have m, n; e M such that 


(13) 


and 


(14) 


J 


| =m+m a+: +m 


| = n +na t: na. 


Since M is proper in R, k + l > 2 and we may assume k + / minimal. By symmetry, we may 
assume also that k >/. Then a/(1 — No) =na'-'1+.... +n, and (1 — No) = (1 — no)mo +... + 


(1 —no)m,a*, so if we substitute the expression for a'(1 — no) given in the first of these relations 
in the last term of the second one, we obtain a relation (13) witha lower k. This contradicts the 


minimality of k + / and proves the lemma. 


The following is the main result on extension of homomorphisms to places. 


THEOREM 9.10. Let Rọ be a subring of a field F and let Pù} be a homomorphism of Ro into 
an algebraically closed field A. Then & to a A-valued place on F. 


Proof. We consider the collection of homomorphisms of subrings of F into A that extend the 
given #). By Zorn’s lemma we have a maximal one defined on the subring RD Rọ. We shall 
show that R is a valuation ring, which will prove the theorem. Let M be the kernel of #. Since 
? maps into a field, R/M is a domain and M is a prime ideal in R. We claim that R is a local 
ring with M as its maximal ideal. Otherwise, we have an element b € R, é M such that b is not 
a unit. Then @(b) £ 0 and the subset of elements of Fof the formab “, a e R, k> 1, is a subring 
R' of F properly containing R. This can be identified with the localization Ræ (see p. 395). It 
follows that # can be extended to a homomorphism of #' of R’ into A. This contradicts the 
maximality of # and proves that R is local with M as its maximal ideal. Then R/M is a field 
and hence I’ = A(R) is a subfield of A. 


If R is not a valuation ring, we have an a € F such that a ¢ R and a ! ¢ R. By the lemma, 
either MR[a] F R[a] or MR[a '] F R[a ~ ']. We may assume the former. Now consider the 
polynomial rings R[x] and I'[x], x an indeterminate. The homomorphism # of R can be 
regarded as an epimorphism of R onto T. This extends to an epimorphism of R[x] onto T[x] 
mapping x ~ x. Let Z be the ideal in R[x] of polynomials f(x) such that f(a) = 0 and let Z’ be its 
image in T[x]. Then Z’ is an ideal in T[x] and if Z' = T[x], we have an f(x) = Xex’, c; € R, 
such that )'"c,a' = 0 and YAa(c))x' = 1. Then Aco) = 1 and A(C;) = 0 ifj > 0. Hence cg = 1 — mg 
and C; = m; where mọ, m; € M. The relation Yo"c,a' = 0 then implies that 1 € MR[a], contrary 
to hypothesis. Thus Z' F T[x] and Z' = (g(x)) where either g(x) = 0 or g(x) is a monic 
polynomial of positive degree. In the first case we choose any r € A and in the second case we 
choose any r € A such that g(r) = 0. This can be done since A is algebraically closed. Now we 
have the homomorphism of R[x] into A, which is the composite of the homomorphism of R[x] 
into T [x] with the homomorphism of T[x] into A, which is the identity on T and maps x into r. 
The homomorphism of R[x] into A has the form h(x) = co + cix +... ~ A(Co) + A(c)r + ... and 
it maps any f(x)e Z into 0. Hence we have the homomorphism of R[x]/Z into A such that h(x) + 
Z ~Aco) + P(c)r + .... Since Z is the set of polynomials such that f(a) = 0, we have the 
homomorphism of #’ of R[a] into A such that h(a) ~ Aco) + A(c))r + .... This is a proper 
extension of # contrary to the maximality of #. Hence Ris a valuation ring and the proof is 


complete. 


We shall now apply this result to valuations. We work first with canonical objects that are 
defined internally. Thus we suppose we have a field Fo and a valuation ring Rọ in Fy. We have 


the canonical valuation @‘9 and canonical place 2'ọ associated with Ro. If Pp is the kernel of # 
‘9, then Pp is the ideal of non-units in the local ring Rọ and the residue field Rọ/Pọ can be 
imbedded in an algebraically closed field A. Then #’g can be regarded as a A-valued place of 
Fy. Let F be an extension field of Fo. Then Rọ is a subring of F, so by the foregoing extension 
theorem, ?'ọ can be extended to a A-valued place # on F. Let R be the valuation ring in F on 
which # is defined, P the ideal of non-units in R. Let Up be the set of units in Ro, U the set of 
units in R. One easily sees that Rp = R N Fo, Po =P N Fo, Up = UN Fp. It follows that we have 
an order monomorphism s of the ordered group G'ọ = F*o/Up, (ordered by the subset of 
elements bUp, b € P *o) into the ordered group G” = F*/U such that 


(15) do Uo ~> aoU, age FR. 


Moreover, the definitions show that if ọ' is the canonical valuation determined by R, then we 
have 


(16) ~' (do) = sPo(4o), ao € Fo. 


In this sense we have an extension of the canonical valuation of Fọ to the canonical valuation 
on F. 


An immediate consequence of this result and the relation between canonical valuations and 
arbitrary valuations is the following extension theorem for valuations. 


THEOREM 9.11. Let ọọ be a valuation on Fo, F an extension field of Fo. Then there exists 
an ordered group G that is an extension of the value group of Fo and a V-valuation 9 of F 
for V= {0} UG that is an extension of Qo. 


Proof. Let Ro be the valuation ring of Øo, gq’ the canonical valuation associated with Rọ. Then 
if we apply the result just proved and the relation between pọ and g'ọ, we see that we have a V- 
valued valuation ọ on F, V= {0} UG, and an order monomorphism ¢ of the value group of @ 
into G such that g(a) = t@p(ap) for every ag € Fo. We can then identify the value group of Fo 


with its image under ¢ in G and we shall have the result stated. Ca 
In the case of finite dimensional extension fields we have the following result. 


LEMMA. Let ọ be a valuation of a field F, Fy a subfield such that [F : Fo| =n < œ. Then 
the value group of F is order isomorphic to a subgroup of the value group of Fo. 


Proof. For any non-zero a in F we have a relation of the form qa”! + o,a"2 + ... +o,a"* = 0 
where the a; are non-zero elements of Fọ and the n; are integers such that [F : Fo] =n =n, = m 


=... 2 ng. If p(aja") > p(aja"/) for all 7 + i, then o(d* a;a"i) = p(a,a’'), contrary to Xaa": = 0. 
Hence we have g((a,a"/) = g(a,a"/) for some pair (i, j) with i >j. Then o(a)": "7 = o(aja; D is 
in the value group Gy of Fo. It follows that for any a £ 0, g(a)" € Go. Since the value group G 


of F has no torsion, the map g ~ g” is an order monomorphism of G. Thus G is order 


isomorphic to a subgroup of Go. 


We can use the proof of this result and Theorem 9.11 to prove the existence of extensions of 
non-archimedean absolute values to finite dimensional extension fields. We have 


THEOREM 9.12. Let Fo be a field with a non-archimedean absolute value || and let F be a 
finite dimensional extension field of Fo. Then | | can be extended to an absolute value | | 
(necessarily non-archimedean) of F. 


Proof. The value group Gp of F is a subgroup of the multiplicative group of positive reals. By 
Theorem 9.11 we have an extension of ọọ = | | to a valuation g' of F whose value group G is an 
extension of Go. The proof of the lemma shows that if n = [F : Fo], then the map g ~ g” is an 
order monomorphism of G into Gp. Then g ~ g”! is an order monomorphism of G into the 
positive reals, which is the identity on Go. If we apply this to g’ we obtain the required 


extension | |. E 


An immediate consequence ofthis result, Theorem 9.8, and our earlier result, Theorem 9.9, 
on archimedean absolute values is the 


COROLLARY. Let F be complete relative to a non-trivial absolute value | | and let E be a 
finite dimensional extension of F. Then the map a ~ |\Ng(a)|! E 4] is an absolute value on E 


that extends || on F. Moreover, this is the only extension of || on F to an absolute value on E 
and E is complete relative to ||. 


EXERCISES 


1. Let G be an ordered abelian group written additively. Ifa € G, define |a| = a if a > 0 and 
a| = —a ifa < 0. Call a subgroup K of G isolated (or convex) if given any a € K, K 
contains every b such that |b| < a. Show that the set of isolated subgroups is totally 
ordered by inclusion. The order type of this set is called the rank of G. (See Hausdorff’s 
Mengenlehre, 3rd ed., Chapter 3.) 


2. Show that an ordered group G (written additively) is of rank one if and only if it satisfies 
Archimedes’ postulate: If a, b € G and a > 0, then there exists a positive integer n such 


that na > b. 


3. Prove that an ordered group is of rank one if and only if it is order isomorphic to a 
subgroup of the ordered group of additive reals. 


(This result shows that the real valued valuations are the rank one valuations, that is, 
the V-valuations such that V= {0} U G and G is of rank one.) 


The next two exercises are designed to indicate a proof of Hilbert’s Nullstellensatz, based 

on the homomorphism extension theorem. In these exercises we assume F algebraically closed. 

4. Let I be a prime ideal in F[x;, ..., x,]. Let E be the field of fractions of the domain F[x,, 

..., xn]/I and write ¢;=x,;+/,1<i<n, in E. Then E = F(q, ..., ¢,) and we may assume 

that {¢), ..., ¢.$, r = 0, is a transcendency base for E/F. For each i > r choose a 
polynomial 


Gig(X 1.465 X) X™ + Gig (X 4,006 XXT free + im, (X15. X) 


in FLX, ...,X,, x] such that ajo(x1, ..., x,) FO and aplčis ..., ¢.)e7" +... + aim (čj, ... Č, 
) = 0. Let g = g(x, ..., Xp) E€ FX, ..., x,], € Z. Then g(c), ..., ¢,) #0 and there exists a 
polynomial 


bo(X 15.2.5 X,)X" + 7+ + A b S PETE A 


e Flix}, ..., X, xX] such that o(¢,, ..., €,) ! is a root of bo(E,, -.. E Xp + os + bplé <- 
¢,.). Show that there exists an F-algebra homomorphism 7 of F[¢, ..., ¢,.] into F such that 
Nbo(ep -> SIT Ail, ---. č) # 0. Use the fact that y can be extended to an F-valued 
place on Æ to show that there exist elements c;, 1 <i < n, such that 


P(Cq,-065,) = 0 for all fel, 
gc: joro A # 0. 


5. Use the result established in exercise 4 and the theorem that the nil radical of an ideal is 
the intersection of the prime ideals containing it to prove Hilbert’s Nullstellensatz. 


The next two exercises are designed to prove that if f(x) and g(x) are two monic irreducible 
separable polynomials with coefficients in a complete non-archimedean field F and f(x) and 
g(x) are sufficiently close in a: sense to be defined, then the fields F[x]/(f(x)) and F[x]/(g(x)) 
are isomorphic. Let F be the algebraic closure of F. Observe that since any finite subset of F is 
contained in a finite dimensional subfield of F/F, the absolute value | | on F has a unique 
extension to an absolute value on F. 


6. Let f(x) e F[x] be monic irreducible and separable, f(x) = [],"(x — a) in F[x]. Put 6 = 
min; z-a ; — a;|. Show that iff € F and |$ — a| < & then F(a) c F(A). 


(Sketch of proof: Consider F(a, PFP). Let g(x) be the minimum polynomial of o4 
over F(£). Then we may assume g(x) = [])"(x — a), m < n. Since F is the algebraic 


closure of F(f), for any j, 1 < j < m, there exists an automorphism o of F/F(8) such that 
oa, = q. Since the extension of | | on F to F is unique |oy| = |y| for any y e F. Hence |8 — 


a| = |o(B — a4)|= |B — a| < ò. Ifm > 1, we can choose j > 1 and obtain |a; — a| > ô so |a; 
— a,|>B—a,| = |B — a| since 6 — a; = ($ — a1) + (a; — a;) this contradicts exercise 2 on 
p. 562. Hence m = 1, F(a,, P) = F(p) and F(a) c F(f).) 


7. (Krasner’s lemma). Let F, F, f(x) = x” + ax™ 1+... + a, be as in 6. Let g(x) = x” + 
b,x” | +... +b, € F[x]. Show that there exists an e > 0 such that if la; — bj < e, 1 < i < 
n, then F[x]/(f(x)) = F[x]/(g(x)); hence g(x) is also irreducible and separable. 

(Sketch of proof: Let M = max(1, |a;), 6 = min ; + (a; — aj) where the a; are the roots of 

fx) in F. Let e = min(l, 6”/(M + 1)"). Suppose the b; satisfy la; — b,| < £ and let £ be a 
root of g(x). Then |b,| < ja) + 1 <M + 1, so by exercise 3, p. 562. b| <M + 1 and Į |; "P — 
al = IB) = WB) - (B) = Ela; — b2” | < e(M +1)". Hence some |8 — aj, say, 8 — ol < 
el”(M + 1)!” < ô. Then by exercise 6, F(a) c F(B). Since [F(a) : F] = n and [F(B) : F] 
<n we have F(a) = F(P) and F[x]/((x)) = F[x]/(g@)).) 

8. (Kurschak). Prove that if F is algebraically closed with an absolute value | |, then the 
completion F is algebraically closed. 


(Sketch of proof: The case in which | | is archimedean follows readily from 
Ostrowski’s theorem. Now assume | | is non-archimedean. We show first that F is 
perfect. This is clear if char F = 0. If char F = p anda e Ê, let {a,,} be a sequence of 
elements of F such that a, — a. Then {a,!/?} is a Cauchy sequence so b = lim a,” 


exists in Ê and b? = a. Thus a!” exists for any a € Ê and Ê is perfect.) 


Suppose Ê is not algebraically closed. Then there exists a polynomial f(x) = x” + ayx”~ ! + 
... ta, E€ F[x] that is irreducible and has degree n > 1. This is also separable. Since F is 


dense in Ê, Krasner’s lemma implies the existence of a monic polynomial g(x) e F[x] that is 
irreducible of degree n in F[x], hence, in F[x]. Since F is algebraically closed this is 
impossible. 


9.9 DETERMINATION OF THE ABSOLUTE VALUES OF A FINITE 
DIMENSIONAL EXTENSION FIELD 


We return to the problem of extension of absolute values posed in section 9.5: Given an 
absolute value | | on a field F and a finite dimensional extension field E/F, determine all of the 


extensions of | | to absolute values on E. By the corollary to Theorem 9.12 we have the answer 
for F complete: There exists one and only one extension of the given absolute value and it is 
given by |a| = Npa)”, n = [E : F]. Moreoover, E is complete. 

Now suppose that F is arbitrary. Let F be the completion of F relative to | |, | | the absolute 
value of Ê extending the given absolute value on F. Let (T, s, t) be a composite of F/F and E/F 
since [E : F] =n < œ, and T =s(F)t(£), [I : s(F)] <n < œ. The absolute value || on Ê can be 
transferred to an absolute value | |, on s(F) by putting |s(a)|, = la| for a € F. This coincides with 
|| on F and s(ĦÊ) is complete relative to | |,. Since F is finite dimensional over s(F), | |, has a 
unique extension to an absolute value | |; on T. Restricting this to f(£) and defining |b|p = |t(b)|r 
for b € E, we obtain an absolute value on E that extends | |. 

Thus we have a procedure for associating with every composite (I, s, t) of F/F and E/F an 
absolute value on £ that extends | |. We shall now show that this correspondence is bijective if 
we identify equivalent composites. First, suppose the two composites (T4, S4, ¢;) and (1, s2, 
t») are equivalent. Then we have an isomorphism u of I’, onto I’, such that us, = s) and ut, = tp. 
For c € T, lu(C)lr2 defines an absolute value on T}. Ifa € F then |u(s,(a))|po = Isa) = la| = 
ls (@)|,,. Thus |u( )}p> is an extension of | |,j. The same is true of | |-;, and since the extension of | 
|l to an absolute value of T} is unique, it follows that |u( )|-> = | |p. This implies (T4, s, ¢,)and 
(T>, S2, t2) provide the same absolute value on E extending | | on F. 

Conversely, suppose the absolute values on E defined by (I), s4, t) and (T>, S2, t2) are 
identical. Then for any b € E, |t,(b) |p; = (b). Next, we observe that the closure of E, i = 1, 
2, with respect to the topology defined by | |p; contains SF) and t(E) and since the closure is 
a subalgebra it contains T; = s,(F )t(E). Thus £; is dense in T;. The map ¢,(b) ~ (b), b € E, is 
an isometric isomorphism of t; (E) onto t,(£). Hence, by Theorem 9.2(2), this map has a unique 
extension to an isometric isomorphism u of T} onto >. Evidently, ut, = t,. Moreover, since u 
is the identity on F and s,(F) is the completion of F relative to | |,;, the restriction of u to s,(F) 
is an isometric isomorphism of s(F) onto s,(F). Also the map s(a) ~ s,(a) is an isometric 
isomorphism of s,(F) onto s,(F ) which is the identity on F. It follows that this coincides with 
the restriction of u to s,(F). Hence us,(a) = s(a), a € F, and u is an equivalence of (T4, s4, 4) 
onto (T>, S2, t2). 

It remains to show that every extension of | | to an absolute value on E can be obtained from 
a composite in the manner indicated. Let | |' be an absolute value on E extending | | and let £’ be 
the completion of E relative to | |. As usual we denote the absolute value on Ê' that extends | | 
on E by | |. The field Ê' > F and E so Ê' D> FE which is a domain finite dimensional over Ê. 
Hence FE is a finite dimensional extension field of Ê. It follows that FE is complete relative 
to | |. Hence Ê' = Ê E and we have the composite (Ê', s, t) where s and ¢ are the injections of F 
and £. The given extension | |' is obtained by restricting | |’ as defined on Ê'. 

We can now apply the determination we gave in section 8.19 of the composites of two 
fields. According to this the composites of Ê and E are obtained by taking T y = (Ê ® p E)/M 


where M is a maximal ideal in Ê ®, E and lettings: a»va®@1l+M,ae F,t:b~1@b+M, 


b e E. Any composite of F/F and E/F is equivalent to one of these and the composites 
obtained from distinct maximal ideals are inequivalent. To obtain the corresponding extension 
to E of || we define ja ® 1 + M|, = la|, a e Ê. This has a unique extension to an absolute value | | 


ry MIm Then |bly = (1 @ b) + Mr, b e E, defines the extension of || on F to £. 


We can summarize our results in the following 


THEOREM 9.13. Let F be a field with an absolute value | |, E a finite dimensional 
extension of F, and F the completion of F. Then we have a bijection of the set of maximal 
ideals M of Ê @,E ( = Ef) with the set of absolute values on E extending | |. The absolute 


value | |ųy corresponding to M is given by \b|y = (1 @ b) + Mpy, b € E where Ty = (F p 
E)/M and | |p is the unique extension of the absolute value ||, on Ê ® 1 such that ja ® 1 + 
M|, la|, a € F to an absolute value on Ty. Moreover, Ty is isomorphic to the completion of 
E relative to | |y. 


Let rad Ep be the Jacobson radical of Ef = Ê®pE. Then Ef/radEf is semiprimitive 
artinian; hence 


(17) E,[rad Ey = 1000, 


where the I, are minimal ideals and are fields over Ê (p. 422). The maximal ideals of this 
algebra are the ideals M;=1, +... + i, +... +I} 1 <j <h, and I, = (Ep/rad Ef)/M,. Since 
rad Ef is the intersection of the maximal ideals of Ef, we have a 1—1 correspondence 


between the maximal ideals of Ef and those of Ef/rad Ef. Accordingly, we have A distinct 
maximal ideals M}, ..., M, of Ep and Ep/M, = (Ep/rad Ef )M, =1,. Hence by Theorem 9.13, 
we have A extensions of | | to absolute values | |), ..., | |, on E where | |; = | |y and J; is 
isomorphic to the completion Ê, of E relative to | |;. We call n; = [E; : F] the local degree of 
EIF at | |;. Then it is clear from (17) and the isomorphism Ê, = 1, that we have the formula 


h 
(18) Y nj = n-[rad Ep: F] 
1 


where n = [E : F] = [Ef : Ê]. Hence we have the inequality 


1 


for the local degrees. 


These results assume a simple concrete form if E has a primitive element: Æ = F(u). In this 
case we have 


THEOREM 9.14. Let || be an absolute value on F, F the corresponding completion of F, 
and let E = F(u) where u is algebraic over F with minimum polynomial fix) over F. Let f(x), 


..+5 f(x) be the distinct monic irreducible factors of f(x) in F [x]. Then there are exactly h 


extensions of || to absolute values on E. The corresponding completions are isomorphic to 
the fields F [x]/(Fi@)), 1 <j <h, and the local degree n; = degf;(x). 


Proof. The field E is isomorphic to F[x]/(f(x)), which has the base (1, ¥,..., ¥"7~ 4) 
where X/ = x! + (f(x)) and n = deg f(x). This is a base also for Eg/F, so Ef = F[x]/(fC)). 
Since F [x] is a p.i.d., the maximal ideals of F [x] /(f(x)) have the form (g(x))/(f(x)) where g(x) 
is a monic irreducible factor of f(x) in F[x]. Hence F[x]/(f(x)) has h maximal ideals 
(f,(x))/(f(x)). The corresponding factor is isomorphic to F [x]/(f,()). The results stated follow 


from this and from Theorem 9.13. 


Theorem 9.14 is applicable when E/F is separable, since such an E has a primitive element 
(BAI, p. 291). Moreover, in this case the derivative criterion (BAI, p. 231) shows that f(x) = 
f\(x) ... f,(x) where the f(x) are distinct irreducible factors. Hence we have the following: 


COROLLARY. Let E be a finite dimensional separable extension of a field F with an 
absolute value ||. Then [E : F] = ,!' n; where nj, ..., n, are the local degrees. 


As an example of these results, let F =0, | | the usual absolute value on 9, and let E = Q(u) 
where the minimum polynomial of u is f(x). Here Ê = R and f(x) factors in R[x] as 


f(x) = [|(x-u:) [l g;(x) 
1 1 


where the g, are irreducible quadratic polynomials. There are no multiple factors. Hence the 


number of extensions of | | to absolute values on Æ is t + s of which ¢ have local degree one and 
s have local degree two. The dimensionality of E/F is t + 2s. 


9.10 RAMIFICATION INDEX AND RESIDUE DEGREE. 
DISCRETE VALUATIONS 


In this section we introduce the important concepts of ramification index and residue degree of 
an extension of a valuation as well as the concept of discrete valuation. Discrete valuations are 
the ones whose value groups are cyclic, and these are the valuations that are of primary interest 


in number theory and in the study of algebraic functions of one variable. 

Let E be a field with a valuation p, F a subfield. Let S be the valuation ring of ọ in E, Q the 
maximal ideal of S, R and P the corresponding objects of F. Then R= S N Fand P= ON) F. 
Hence we have the monomorphism 


(20) a+P~atQ 


of R = R/P into § = S/Q. The image is (R + Q)/Q. This can be identified with R by means of the 
isomorphism (20). In this way we can regard R as imbedded in § and we can consider the 
dimensionality 


(21) f= [BR], 


which is called the residue degree of E/F relative to g. It is clear also that the value group 
~(F*) is a subgroup of the value group g(E*). Accordingly, we have the index 


(22) e = [@(E*): o(F*)]. 


This is called the ramification index of E/F relative to 9. 

We now consider non-archimedean absolute values, or equivalently, valuations whose 
value groups are contained in the multiplicative group of positive real numbers. Then S= {a € 
E | |a|< 1} and Q= {be E| |b] < 1}. We assume also that | | is non-trivial and we show first that 
if [E : F] =n < œ, then both the ramification index and the residue degree are finite; for we 
have 


PROPOSITION 9.1. ef<n=[E: F]. 


Proof. Let uy, ..., uy be elements of S such that the cosets u; + Q of S are linearly 
independent over R (= (R + Q)/Q). This means that if we have elements a; € R such that Ẹ, aju; 
Ee Q, then every a; € Q. Let bj, ..., be} be elements of E* such that the cosets |b,| |F*| are 
distinct elements of the factor group |E*\/|F*|. We proceed to show that the e;f, elements u,b; 
are linearly independent over F. If we multiply the b, by a suitable element of F*, we can bring 
these elements into Q. Hence we may assume that the bj e Q. We shall show first that if a; € F 
and Jaju; # 0, then Z a;u;| € |F*|. For if au; + 0, then some a; # 0 and we may assume that 0 # 
la,| > |a, for all a;. Then Xau; = la [Say lau and Ya, lau < 1. If ay tau) < 1, Yay lau; 


l a; e R for every i. Then the relation Ya; ! au; € Q 


e Q and since |ay laj < 1, ap 
contradicts Ihe choice of the u;. Hence |Ya; ! aju,| = 1 and Xau; = |a,| € |F*| 

Now suppose we have a relation )'; ;a,u,b; = 0 for a, € F. If there exists a j such that 
dia; F 9, then the relation };; ,a,u;b; = 0 implies that have distinct j, say, j = 1, 2, such 
that |X ia publ = [));aju;b2| # 0. Then 5a, 1 u; #0 and Japu; + 0, so XZ apu;| and XZ apu; € |F*|. 


Then |b | F*| = |b2|F*|, contrary to the choice of the 5;. Thus the relation }'a,,u,b; = 0 implies 


that )/4,;u; = 0 for every j. If we multiply the a; by a suitable nonzero element of F, we obtain 


a relation a’ u; = 0 witha’; € R and if some a’, # 0, we may assume one of the a’; € Q. This 


contradicts the choice of the u; and proves the F-independence of the e; fı elements u;b;. 
Evidently this implies that ef < n. E 


The inequality ef < n can be sharpened by passing to the completions. Consider the 
completion Ê of E, which contains the completion Ê of F. We observe first that £ and Ê have 
the same value groups. Let @ e Ê*. Since E is dense in Ê, there exists an a € E such that |a — â| 
< |å]. Then |a| = |@| and hence the value group |Ê*| = |E*|. Likewise |Ê *| = |F*|. Hence E/F and Ê/ 
F have the same ramification index. 

Now let $ and R be the valuation rings of Ê and Ê respectively, @ and P the corresponding 
maximal ideals. We have S= $ N E, O=@ N E, R= RNF, and P= Ê N F. If å e R, we have 
ana € F such that |å — a| < 1. Then å — a € P, soa e ÊN F= R. Hence Ê = R + P and we 
have the canonical isomorphism 


(23) a+P~a+P 


of the residue field R = R/P onto the residue field R/P. Similarly we have the isomorphism b + 
O ~ b+Ô of S/O onto §/@. This is a semi-linear isomorphism of S/O regarded as vector space 
over R/P onto §/@ regarded as vector space over R/P relative to the base field isomorphism 
given by (23). Hence [S/O : R/P] = [S/@ : R/P], so the residue degree as well as the 
ramification index is unchanged on passing to the completions. On the other hand, [E : F] =n is 
replaced by the local degree n = [Ê : Ê] and this will generally be less than [E : F]. We have 
the sharper inequality ef < ñ <n. 

Considerably sharper results can be obtained if | | is discrete. A valuation g is called 
discrete if the value group of ọ is cyclic. If this group is not trivial, then it is infinite cyclic and 
hence it can be realized in the multiplicative group of positive reals. Thus g can be regarded as 
a non-archimedean absolute value. The p-adic absolute values on @ are discrete since |p| is a 
generator of the value group. Also the absolute values of F(x) that are trivial on F are discrete 
(p. 566). We recall also that if F is a finite dimensional extension of F, then the value group of 
E is isomorphic to a subgroup of the value group of F (p. 582). Hence ifa valuation is discrete 
on F, then any extension of it to a valuation on a finite dimensional extension field is also 
discrete. 

Suppose that | | is discrete and non-trivial on F and let c be a generator of |F*| such that c < 
1. Let z be an element of F* such that |z| = c. Then z € P. Ifa € F*, a| = eÉ for k € Z, so |an 
‘= 1] and ar * = u is a unit in R. Then a = uz". Evidently a € R if and only if k > 0 and a € P 
if and only if k > 0. Hence P = zR. It is easily seen that the only ideals in R are the ideals p* = 
wR, k> 0. Hence the valuation ring of a discrete | | is a principal ideal domain. Moreover, this 
property is characteristic, since we have the following 


PROPOSITION 9.2. Let R be a local p.i.d. (= local ring that is a principal ideal domain). 


Then R is a valuation ring in its field F of fractions such that the corresponding valuation is 
discrete. 


Proof. Let rad R = P= aR. Since R is a p.i.d., R is factorial and since zR is the only maximal 
ideal in R, z and its associates are the only primes in R. Hence every non-zero element of R 
can be written in one and only one way in the form wz* where u is a unit and k > 0. Then every 


non-zero element of F can be written in one and only one way in the form vz! where v is a unit 
in R and / e Z. It follows that R is a valuation ring in F and the corresponding valuation is 


discrete. 


In view of Proposition 9.2 and the result preceding it, it is natural to call a commutative 
ring R having the properties stated in the proposition a discrete valuation ring (often 
abbreviated as DVR). As we shall see in Chapter 10, these rings play an important role in 
multiplicative ideal theory. 


We shall now prove 


PROPOSITION 9.3. Let F be complete relative to a discrete valuation | |, E a finite 
dimensional extension field of F, | | the unique extension of || to E. Then ef=n=[E: F]. 


Proof. We know that || is discrete on £ and E is complete. Let P, R, Q, S be as before and let 
z and [| be chosen so that P = zR, O = [[S. Then |z| and |] [| generate |F*| and |E*| respectively. 


Since [|E*]:|F*|] = e, we have |z| = |[]|f and hence z = uf [° where u is a unit in S. Let wy, ..., uy 
be elements of S such that the cosets u; + Q form a base for S/O over R/P. We shall show that 
the ef elements 


(24) ull, 1<i<f, 0<j<e-l, 


form a base for E/F. It is clear that the cosets [}/|F*|, 0 <j < e — 1, are distinct elements of 
|E*/|F*|. Hence the proof of Proposition 9.1 shows that the elements u; f are linearly 
independent over F. It remains to show that every element of E is a linear combination of the 
u;[ ¥ with coefficients in F. We shall show first that every element of S can be written as Dey 


aju] Y with a; € R. Letv e S. Then j| = IŻ] for some k > 0. We can write k = m;e + j} where 
m> 0 and0 <j; <e-l1. Then [v| = [J]? */1| = IF], so v = wx |], where w| = 1, so w € 
S. Then there exist a; € R such that w — J `'a;u; € Q, that is, |w — )a,,u,|< 1. Hence if we put 


vj =0— 3 au) n™ 0 EL 
i 


then v; = (w — Ya,u)a™[ and |v,| < W| = TF = [2 771. Then v = $b; up! + v, where b; 
= qm"! € R. We repeat this process with v; and obtain V, = X bpu;l J2 + vz where bp € R and 


v| < |v, |. We substitute this in the expression for v and iterate this process to obtain 


f ie} 
v=) J culty, melon 

i=1 j=0 
where |v| > |v,| > |v] > ... and the ch”) €e R. Then v,, — and Ye uF — v, Since F is 
complete and the ef elements uJ are linearly independent, the proof of Theorem 9.8 shows 
that lim c; = C} exists for every i, j. Since |c”),,| < 1 we have |c,| < 1, so cy € R, Hence v = 
Xcu; f with every C; € R. If v' is any element of E, we can multiply it by a non-zero element 
of F to obtain a v with |v| < 1. Then applying the result that we proved for v we see that v' is an 


F-linear combination of the u;Į ¥. This completes the proof. 
We can now prove the main result on residue degrees and ramification indices. 


THEOREM 9.15. Let F be a field with a non-archimedean absolute value | |, E an 
extension field of F such that [E : F] =n < œ, and let | |,, ..., | |, be the extensions of | | to 


absolute values on E. Let e; and f; be the ramification index and residue degree of EIF 
relative to ||,. Then Yef,;<n and >'ef,=n if||is discrete and E/F is separable. 


Proof. Let Ê; be the completion of E relative to | |; so Ê; D Ê, the completion of F relative to | |. 
Then we have seen that if n; = [Ê; : Ê], then ef; < n; and Yn; < n. Hence Yef; < n. We have 
seen also that n; = n if E/F is separable and n; = ej; if | | is discrete. Hence Vef; =n if | | is 


discrete and E/F is separable. E’ 


EXERCISES 


1. Determine the value group, valuation ring and its ideal of non-units for the field Q, of p- 
adic numbers. Show that the residue field is isomorphic to Z/(p). (See exercise 1, p. 560.) 


2. Let R be a valuation ring in F. Show that if R is noetherian, then R is a discrete valuation 
ring. 


9.11 HENSEL’ S LEMMA 


We shall prove next a reducibility criterion for polynomials with coefficients in a valuation 
ring, known as Hensel’s lemma. In the text we treat the most important case of this lemma in 


which the valuation is discrete and the polynomials are monic. The general form of the lemma 
is indicated in the exercises. The proofs given are due to D. S. Rim. The key result for these 
considerations is the following 


PROPOSITION 9.4. Let F be complete relative to a discrete valuation ||, R the valuation 
ring of | |, P its maximal ideal, and R = R/P. Suppose that fix) is a monic irreducible 
polynomial in R[x]. Then the image J (x) € R[x] is a power of an irreducible polynomial in R 


[x]. 


Proof. Let E be a splitting field over F of f(x). Then | | has a unique extension to an absolute 
value | | on Æ. Let S be the valuation ring determined by the absolute value | | on Æ, Q the 
maximal ideal of S. Leta € E, o e Gal E/F. Then Ng;p(a) = Nga). Hence by (7), |oa| = |a|. It 


follows that o(S) = S and o(Q) = Q. Hence o determines the automorphism 
(25) å> ca 


of S/Q. Evidently this is the identity on R = R/P, so ë e Gal S/R where § = S/Q. Now the 
given polynomial f(x) is irreducible in F[x]. This is clear since R is a p.i.d. and hence is 
factorial (BAI, p. 148). We have the factorization f(x) = [],"(« — 7;) in E[x]. Let a, = (0) = 
[C r). Then Ngr(r) = (C 1)"a,)° where e = [E : F]/n. Since a, € R, it follows that |r| < 1 
and the r; € S. Applying the canonical homomorphism of S onto § we obtain J (x) = [](x — R). 
Let f, % be any two of these roots. Since f(x) is irreducible in F[x], we have an automorphism 
o Gal E/F such that or; = r,. Then ar: =", which implies that ë. and ' have the same minimum 


polynomial over R. If this is #(x), then J (x) is a power of (x). E 
We can now prove the following version of 


HENSEL’ S LEMMA. Let F be complete relative to a discrete valuation | |, R the valuation 
ring, P the ideal of non-units, R = R/P. Suppose that f(x) is a monic polynomial in R[x] such 
that Í (x) = I(x) E(x) where F(x), §(x) are monic and (F(x), 5(x)) = 1. Then f(x) = g(x)h(x) in 
R[x] where g(x) and h(x) are monic and (x) = ¥(x), h(x) = E(x). 

Proof. We have the factorization f(x) = [])°f,(x)°! in R[x] where the f(x) are the distinct monic 
irreducible factors. By Proposition 9.4, J (x) = 9,x)*! where 9,(x) is monic and irreducible in 
R[x]. Then /(x) = [],°9,(x)*! and since 7(x), 5(x)) = 1, we may assume that 7(x) = IOA 
and 5(x) = JJS, 1 9(x)°* where 9 (x) + @1(x) for the j and / indicated. Put g(x) = [ HHO, AQ) 


=[T,+1/:@)%. Then g(x) and h(x) satisfy the required conditions. 


Hensel’s lemma can often be used to conclude the existence of roots of equations f(x) = 0 in 
R from the existence of roots of J (x) in R. The precise result giving this is the 


COROLLARY. Let F, R, P, f(x) etc. be as in Hensel’s lemma. Suppose that J (x) has P as a 
simple root in R[x]. Then f(x) has a root r in R such that F=P . 


This is clear, since we have J (x) = (x — P )5(x) where 5(? ) #0 so (x -Ë , 5(x)) = 1. 
Hensel’s lemma can also be used in conjunction with Theorem 9.14 to determine the 


number of extensions and local degrees of a p-adic valuation of @. This is illustrated in the 
second exercise below. 


— 


EXERCISES 


. Show that x° = 4 has a root in @., the field of 5-adic numbers. 


. Determine the number of extensions and local degrees of the p-adic valuation of @ for p = 


3, 5, 11 to the field of fifth roots of unity. 


. Show that Q, has p-distinct pth roots of 1. 


The next three exercises are designed to prove a general Hensel’s lemma. 


. Let R be a valuation ring in the field F. Call a polynomial f(x) € R[x] primitive if some 


coefficient of f(x) is a unit. Prove Gauss’ lemma that the product of two primitive 
polynomials in R[x] is primitive. Show that any non-zero polynomial in F[x] can be 
written as a product cg(x) where c € F* and g(x) is a primitive polynomial in R[x]. Show 
that c and g(x) are determined up to a unit in R. Show that if g(x) is primitive and 
irreducible in R[x], then g(x) is irreducible in F[x]. 


. Let F be a field, and R a valuation ring in F such that if E is any finite dimensional 


extension field of F, then there is a unique valuation ring S of E containing R. (Note that 
this holds if F is complete relative to the non-archimedean absolute value || and R is the 
valuation ring of | |.) Prove that Proposition 9.4 holds for monic polynomials in R[x]. 


. (Hensel’s lemma.) Assume that F and R are as in exercise 5. Let f(x) € R[x] and assume / 


(x) = 7(x)8(x) in R[x] where Fx), &(x)) = 1 and degi(x) > 0. Then f(x) = g(x)A(x) in R[x] 
where f(x) = 7(x), h(x) = 5(x) and deg g(x) = deg 7(x). 


. Let F be a field and let F? be the set of maps of @ into F. If fe F®, define Supp f= {ae Q 


Wa) 4 0}. Let P(F) be the subset of F? off such that 


(i) Supp fc Zm ! for some positive integer n (depending on f), 

(ii) Supp f is bounded below. 
The elements of P(F) can be represented as formal series Jia e Suppr)" these are 
called Newton-Puiseaux series. If f, g e P(F), define f+ g and fg by (f+ g)(a) = f(a) + 
g(a), (f2)(@) = X'p +y=a/B)g(y) (which is well defined). Define 0 by 0(a) = 0 for all a 


and 1 by l(a) = 1 if a = 0 and l(a) = 0 if a # O. Verify that (P(F), +, .,0, 1) is a 
commutative ring. Show that any finite subset {f}, ..., f.} of P(F) is contained in a 
subring isomorphic to a field of Laurent series J; > 4 a;t'. Hence show that P(F) is a 


field. 

Let v(0) = œ and v(f) for f# 0 be the least rational in Supp f. Verify that o(fg) = o(f) + 
v(g), of + g) = min (v(f), v(g)). 

Show that if à is a positive rational number, then the map 7, of P(F) defined by (qf) 
(a) = AA la) is an automorphism of P(F). (Symbolically, 7, : Yf(a)t% ~ ¥ flat.) We 
have vo(7 ; f) = (f). 


8. (Newton-Puiseaux.) Prove that if F is algebraically closed of characteristic 0, then P(F) is 
algebraically closed. (Sketch of proof, following an exposition by S. Abhyankar: We have 
to show that if g(x)e P(F)[x] is monic of degree n > 1, then g(x) has a factor of degree m, 
1 <m < n. By applying a suitable automorphism we may assume g(x)e F((t))[x] and by 


replacing g(x) =x" + fix"~!+...+f,,f € FO), by g(x — (I/n)f;) we may assume f} = 0. 
Suppose first that all of the f} € F[[t]] and for some f, f,(0) £ 0, that is, o(f;) = 0. Let #(x) 
=x" + f5(0)x”~2 + ... +f,((0)). Note that 9(x) e F[x] is not a power ofa linear factor and 
hence apply Hensel’s lemma to show that if n > 1 then g(x) has a factor of degree m, 1 < 
m <n. Now assume that g(x) is arbitrary of the form x” + fx” 72 +... + fp fe F(a). We 
may assume some f; # 0. Let u = inf {o(f/ij2 < i < n} and letr, 2 <r <, n, satisfy o(f,.)/r = 
u. Apply the automorphism y, to the coefficients and follow this with the automorphism of 
P(F)[x], which is the identity on the coefficients and sends x ~ tx. Multiplying the 


resulting polynomial by A °”) we obtain a monic polynomial satisfying the conditions 
considered at first. The validity of the result for this implies the result for g(x).) 


9.12 LOCAL FIELDS 


A field F with an absolute value | | is called a local field if the pair (F, | |) satisfies the 
following conditions: 


(1) || is non-archimedean discrete and non-trivial. 
(2) F is complete relative to | |. 
(3) The residue field of | | is finite. 


Typical examples are Q, and the field /((x)) of formal Laurent series in one indeterminate over 


a finite field F. For the first, this was indicated in exercise 1, p. 561. The residue field here is 
z/(p). The second example can be regarded as the completion of the field F(x) of rational 
expressions in x where the absolute value is | |,, as defined on p. 539. The valuation ring R of 


F(x) is the set of rational expressions x*b(x)/c(x) such that b(0) + 0, c(0) # 0, and k > 0. The 


maximal ideal P of R is the set of these elements with k > 0. The map sending an element of R 
into its value at 0 is a homomorphism of R onto F whose kernel is P. Hence R/P = F. It follows 
that the residue field of the completion *(*) = F((x)) is also isomorphic to F, so F((x)) is a 
local field. 

Properties (1), (2), and (3) carry over to a finite dimensional extension field; hence if F is 
local and F is a finite dimensional field over F, then E is local. 

It is quite easy to determine all of the local fields. We prove first the 


LEMMA 1. Let F be a local field and let |R/P| = Np. Then R contains Np distinct roots ©, 
G25 «+19 OND of xù» = x and these elements constitute a set of representatives of the cosets of P 


in the additive group R. 


Proof. Since Np is the cardinality of the finite field R = R/P, Np is a power of the 


characteristic of R and R is a splitting field over the prime field of x» — x (see BAI, p. 287). 
Let Cy be any element of R. Then by the Corollary to Hensel’s lemma (p. 593), there exists a ¢ 


€ R such that YP = Cand (+ P= Cp. If Co # G is another element of R and ¢ € R satisfies CNP 
=¢, C +P = Co, then ¢+ P 4 ¢ +P. Hence we can obtain Np elements C), G, ..., CN» such that C 


NP =¢ ; and the cosets C ; + P are distinct. Ld 


Put A= {C), bz ---» Gy, Let a be an element such that zR = P. Then |z| is a generator of the 
value group |F*|. More generally let z, be an element such that |z;| = |‘, k € Z. In particular, 


we can take m, = z*. Leta € F*. Then we claim that we can write 
(26) A= Ak My T Akti Mkt. 


where the a; € A, ky < k < ..., and a # 0. Let |a| = |z;,,|. Then ao” ie € R, ¢ P so there exists 
7 =| = = = 

an o,, #0 in A such that ao” 4 = a} (mod P). Then |a — o mp |< lal. If a = ag, m, we have 

(26). Otherwise, we repeat the argument with a — Oy, Tg- By induction we obtain k, < ky <... 


and a% , O, --- non-zero in A such that 
Ja| > |a— o T | > |a— oy Tk, — tkk] > 0°. 


Then we obtain (26). It is clear also that the a’s such that (26) holds are uniquely determined 
and that a € Rif and only if k> 0. 

Now let Fọ be a subfield of F such that (1) Fy > A, (2) Fo is closed in the topology of F, 
and (3) Fy N P #0. Put Ry = R N Fo, Po =P N Fo. We shall call a polynomial f(x) = x” + bx” ~ 
l+... +b, € Ro[x] an Eisenstein polynomial in Ro[x] if the b; € Py and b; € ¢ P*o. Then we 
have 


LEMMA 2. Let Fo be a subfield of F such that (1) Fy D A, (2) Fo is closed in the topology 
of F, and (3) Po = Fo N P #0. Let x be an element of R such that P = (x). Then F = F(T) and 
a is algebraic over Fy with minimum polynomial an Eisenstein polynomial over Ry = R N Fo. 


Proof. Itis clear that Fp is a local subfield of F. Now let Pp = (zp) (in Ro), P = (x). Then || = 
x|? for e > 1. Ifk e Z, we have k= eq +r where 0 <r < e —1. Hence if m, = m/z’, then [r] = | 
7*|. It follows as above that any a e F* can be written as in (26) using these z;, and that if a € 
R, then we have k > 0 in (26). We can rearrange the terms of this sum and obtain 


(27) A= aytaynt+*++a,-\n°"! 


where each a; has the form Xa, zo! and for a € R the summation is taken over the q = 0 so the 
a; € Ro. Now |ajr'| has the form af? *'. Hence jar] # lav if i #j and O < i, j < e — 1. It 
follows that if Ya’ = 0, then every a; = 0. Thus (1, z, ..., 7° 7 ') is a base for F/F and hence 
F = F(z). Moreover, applying (27) to a = n? we see that m is algebraic with minimum 
polynomial of the form xë + b,x°~ 1+... +b, where the b; € Ro. Then Nrypo(z) = +b, and |b,| = 
IN riro(2)| = |2°| = To (see (7)). Hence b, € Po, € Po”. Suppose one of the b; éP}. Then (x) = å 
o(x)x’ in (Ro/Po)[x] where j > 1 and */90(%), It follows from Hensel’s lemma that fx) is 
reducible. This contradiction shows that every b; € Py and hence f(x) is an Eisenstein 


polynomial. E 


We can now prove 


THEOREM 9.16. The local fields are either fields of formal Laurent series Fo((x)) where 
Fy is a finite field or the finite algebraic extensions of fields of p-adic numbers. 


Proof. Assume first that F is of finite characteristic. Then R/P has the same characteristic. Let 
A= {C1, G, «> Gyp}, the set of elements of F such that ¢Np = ¢.. Since Np is a power of the 
characteristic of F, it follows that A is a finite subfield of F. Let z e P satisfy P = (x). Then 
we have seen that every element of F has the form }/;., aja , a E A this expression is unique. It 
follows that F = A((z)). Now assume that F is of characteristic 0. Then F contains @ and the 
valuation is non-trivial on Q since R/P is finite, so Z N P40. Hence F contains Q, for some p. 
Let A be as before and let Fy = Q(A). Then Fọ satisfies the conditions of Lemma 2, so F = 


F(z) is algebraic over Fo. Then F is algebraic over Q,. E 


We shall now analyze the structure of a finite dimensional extension field Eof a local field 
F. Then £ is also local. Let e and f be the ramification index and residue degree of the absolute 
value || of E extending the given || on F. Then ef= n. We shall show that £E is built up from F in 


two stages: E> WD F where W is unramified over F in the sense that the ramification index 
associated with W/F is 1, and E is completely ramified over W in the sense that the associated 
residue degree is 1. 


Let R, P, S, O have the usual significance (p. 568). Then [S/Q : R/P] =f, so 
(28) No = |S/Q| = |R/P|! = Np’. 
Moreover, the Galois group of S/Q over R/P is cyclic with generator 
(29) a~ a”. 


Let A, and Ap denote the set of roots of xo = x and x? = x in E and Frespectively. Then Ap 
D Ap. Put W= F(A p). Then we have 


LEMMA 3. [W: F] =f and WIF is unramified. 


Proof. We have the isomorphism ¢ ~ Ẹ = ¢ + QO of the group A*; of non-zero elements of Ap 
onto the multiplicative group of the field S/Q. Hence if ¢ is a primitive (No — I)-st root of unity 
in S, then $ is a primitive (No — 1)-st root of unity of S/Q. We have W= F(C) and S/Q = (R/P)( 
©). Let 9o(x) be the minimum polynomial of Ẹ over R/P, so deg g(x) = [S/Q: R/P] = f and x™o 
— x = 9 (x)ho(x) in (R/P)[x]. By Hensel’s lemma, x0 — x = g(x)A(x) where g(x) and A(x) are 
monic in R[x] and 9(x) = fax), hox) = ho(x). If g(C) # 0, then h(C) = 0 and hy($ ) = 0, which is 
impossible since (f) = 0 and x%o — x has distinct roots. Hence g(¢) = 0. Since 9 (x) is 


irreducible in (R/P)[x], g(x) is irreducible in R[x] and hence in F[x]. Then g(x) is the minimum 
polynomial of ¢ over F and hence [W : F] = deg g(x) = f. Since Ag c W, it is clear that the 


residue degree of the valuation of Wis f and since [W : F] = f, it follows that the ramification 


index is 1. Thus Wis unramified. 


Now Wis a splitting field over F of the polynomial xe — x, which has distinct roots. Hence 
W is Galois over F. Let G = Gal W/F. As in the proof of Proposition 9.4 (p. 573) anyo € G 
determines an automorphism? : @ ~ sa, a € S, contained in Gal (S/Q)/(R/P). Now o maps A*E 
into itself and ë maps the multiplicative group of S/Q into itself. Since the map a ~ a4 of A*; is 
injective and W = F(A p), it follows that o ~ ë is a monomorphism. Since |G| = [W : F] = [S/Q 
: R/P] = |Gal(S/Q)/(R/P)|, it follows that o ~ ë is an isomorphism of the Galois group of W/F 
onto the Galois group of S/Q over R/P. The latter is cyclic. Hence Gal W/F is a cyclic group. 

Since W is unramified, |W*| = |F*| and hence [|E*|: |W*|] = e. Thus e is the ramification index 
of E over W and this is also [E : W]. Hence E is completely ramified over W. By Lemma 2, if z 
is chosen so that (J [) = Q, then E = W([]) and the minimum polynomial of Į | over W is an 
Eisenstein polynomial. If E is unramified, then clearly E = W. It is easily seen also that in any 
case W contains every unramified subfield of E/F. We have proved the following results. 


THEOREM 9.17. Let F be a local field, E a finite dimensional extension field of F Then E 
contains a unique maximal unramified subfield W. We have [W:F] = f, the residue degree of 
E over F, and W is a cyclic field over F. E is completely ramified over W and [E: W] = e, 
the.ramification index. Moreover, if || is chosen so that Q = (] [), then E = W({]) and the 
minimum polynomial of | | over W is an Eisenstein polynomial (over SM W). 


There is one other important result in these considerations. This concerns the existence of a 
distinguished generator of Gal W/F for W unramified. We have such a generator @~4** for the 
Galois group of the finite field S/O over R/P. We have a corresponding automorphism in Gal 
W/F. This is called the Frobenius automorphism of W/F. It is characterized by the property 
that it maps any ÑE Ar into ¢* 


EXERCISES 


1. Let the notations be as in Theorem 9.17. Show that if the characteristic of S/Q is not a 


divisor ofe, then II can be chosen so that its minimum polynomial has the form x°—z, z € 
P. 


2. Let E, F be as in Exercise 1, with char S/Q | eand assume that E/F is Galois. Show that 
Gal E/W is cyclic of order e. Hence show that Gal E/F is an extension of a cyclic group 
by a cyclic group. Show also that if E/F is an abelian extension field, then e|(Ng — 1). 


3. Show that if R is a valuation ring, then any Eisenstein polynomial in R[x] is irreducible. 


9.13 TOTALLY DISCONNECTED LOCALLY COMPACT DIVISION 
RINGS 


In this section we propose to show that local fields and more generally finite dimensional 
division algebras over local fields have a simple topological characterization: These are the 
non-discrete totally disconnected locally compact topological division rings. Using 
topological methods, we shall also determine the structure of these division rings. These 
results will be used in the next section to determine the Brauer group of a local field. 

By a topological division ring D we mean a division ring that is a Hausdorff space in which 
subtraction and multiplication are continuous maps of the product space into the underlying 
space D and the map x ~ x! is continuous on the subspace D* of non-zero elements. We 
assume that the topology is not discrete. It is easily seen that a topological division ring is 
either connected or totally disconnected, which means that the only connected subsets are the 
points. We recall that a space is locally compact if every point has a compact neighborhood. 
The locally compact fields, both in the connected and totally disconnected cases, were 
determined by D. van Dantzig in 1931. The connected locally compact division rings were 


determined in 1932 by L. Pontrjagin to be one of the classical trinity: # (Hamilton’s 
quaternions), €, and R. The totally disconnected ones were determined by the author in 1936, 
assuming the first countability axiom. This condition and the hypothesis that x ~ x! is 
continuous were removed by Y. Otobe in 1945. In this section we begin by proving the first 
countability property of locally compact division rings by a simple argument given in the 
second edition of Pontrjagin’s Topological Groups. After this we follow the method of our 
1936 proof with some improvements. Pontrjagin’s book can serve as a reference for 
topological definitions and results, which we shall state without proofs.* 

Let D be a locally compact division ring (not discrete) and let C be a compact subset of D, 
W a neighborhood of 0. Since a0 = 0 for every a and multiplication is continuous, for any x € 
C there exist neighborhoods U, of x and V, of 0 such that U,V. = {uvu Ee Up ve Vp CW. 
Since C is compact, a finite subset Uk l Uy} covers C. If we put "= Mi Vx then we have CV 
c W. Next let U be a compact neighborhood of 0. Then U is infinite, since D is not discrete. 
Let {b, } be an infinite sequence of distinct elements of U and put B, = {bm =k). Then B, > 
B, >... and NB, = ¢. Hence some B, is not closed and we may assume that B = {b,,} is not 
closed. Let b €e B, the closure of B, b ¢ B. Then b is a limit point of B and b € U. Using a 
translation we may assume b = 0. We now claim that the set {Ub,} is a base for the 
neighborhoods of 0, that is, given any neighborhood W of 0 there exists an n such that Ub, c 
W. Since U is compact, the result we proved first shows that there exists a neighborhood V of 0 
such that UV c W. Since 0 is a limit point of B, there exists a b, in V N B. Then Ub, c U c W. 
This proves the first countability axiom of D and permits us to base the topological 
considerations on convergence of sequences. 

The concept of a Cauchy sequence in a topological abelian group is clear: {a,} is Cauchy 
if given any neighborhood U of 0 there exists an integer N such that a,,— a, € U for all m, n > 
N. Such a sequence converges (to a limit) if a subsequence converges. From this it follows that 
any locally compact abelian group is complete, that is, every Cauchy sequence converges. For, 
if {a,} is Cauchy and U is a compact neighborhood of 0, then every ay,, — ay € U for N 
sufficiently large and p = 1, 2,... A subsequence of {ay + p — anp = 1, 2,...} converges and 
since this sequence is Cauchy, we have a b such that lim ,_,.. (ay + p` 4y) = b. Then lim, o dy, 
=qay tb. 

Again let D be a locally compact division ring and let U and V be compact neighborhoods 
of 0 such that V F U. Then there exists a neighborhood W of 0 such that WU c V and hence any 
w € Wsatisfies the condition 


(30) wu cV. 


We shall now show that if w is any element satisfying (30), then w” — 0. Let u be a non-zero 
element of U. Since w’U c U,w” e Uw! for n = 1, 2, 3,..., and Uw! is compact. Since the 
power sequence {w”} is contained in a compact set, to prove w” — 0 it suffices to show that 0 
is the only limit point of ‘"), that is, if w’* — z for a subsequence {w”*} of {w”} then z = 0. 


Suppose not. Then there exists a subsequence {wx} where the m, are differences n; | — n 


such that w”k — 1. Then lim wu = u for any u € U. Since every wu € wU and wU is 
closed, u = lim wu e wU. Then UC wU c J, contrary to V F U. 
We shall write a, — œ if no subsequence of {a,,} converges. We shall now use the fact that 


W contains elements w # 0 such that w” — 0 to prove 


PROPOSITION 9.5. Jf D is a locally compact division ring, then a sequence of non-zero 
elements {a„} satisfies a, — © if and only if ap! — 0. 


Proof. Since we are assuming continuity of x + x"! in D*, a, — a # 0 implies that a, `! > a 
l. Hence it suffices to show that we cannot have both a, — œ and a,~! — oo. Suppose this is 
the case and let U be a compact neighborhood of 0. By dropping some terms at the beginning, 
we may assume that a, ¢ U and a,~! ¢ U for n = 1, 2,.... Let w be an element such that w + 0 


and lim w” = 0. Suppose that for a fixed j we have infinitely many n such that a„w' € U. Since 
U wÏ is compact, this would imply that a subsequence of {a,,} converges, contrary to the 
hypothesis. Hence for every j there exists an n, such that a,w ¢ U for n> n;. This implies that 
there is a subsequence {b,} of {a,} such that b,w” € U. Since w” — 0, for each n there exists 
ak, > n such that b,,w'» € U but b,w*n*! © U. A subsequence of {b,w*» + !} converges, so we 
may assume that b,w*» *! — z. Thus and b,w*» > zw! and zw! £ 0 since b„w*» € U. Then 


wb | — wz! and b, ! = wn(w nb!) — 0. This contradicts a,~! — oo. Z 


Let G be a locally compact totally disconnected abelian group. Then G contains a base for 
the neighborhoods of 0, which are compact open and closed subsets of G. (See Pontrjagin, loc. 
cit. p. 87.) We can now prove 


PROPOSITION 9.6. Jf Gis a locally compact totally disconnected abelian group, then the 
set of compact open (hence also closed) subgroups of G form a base for the neighborhoods 
of 0 in G. 


Proof. Since 0 has a base of compact open and closed neighborhoods, it suffices to show that 
if U is such a neighborhood then there exists an open subgroup of G contained in U. Let V = U 
M—U where —U= { — ujue U} and let H= {h € Gļh + V c V). Then H is an open subgroup of G 


contained in V and hence H c U. 


Next we prove the existence of compact and open subrings of a totally disconnected locally 
compact division ring. 


PROPOSITION 9.7. Let D be a totally disconnected locally compact division ring and let 
H be a compact and open subgroup of D. Then the set R= {a € D\jaH c H} is a compact and 


open subring of D. 


Proof. It is clear that R is a subring and R contains an open neighborhood of 0, so R is open 
and hence closed. If h is a non-zero element of H, then R c Hh7!. Since Hh“! is compact, it 


follows that R is compact. 


We now let R be any compact and open subring of D and we investigate the arithmetic 
properties of R. We observe first that if b is a non-zero element of R, then bR and Rb are open 
right and left ideals of R. If J is any nonzero right (left) ideal of R, then ! = UserR(UnesR), so T is 
open. Then R// is compact and discrete and hence R// is finite. Evidently this implies that R is 
left and right noetherian, and if B is any ideal 40 in R, then R/B is finite and hence is left and 
right artinian. We note also that the set of non-zero ideals of R is a base for the neighborhoods 
of 0. For if U is any compact neighborhood of 0, there exists a neighborhood V of 0 such that 
RVR = {ava'a, a'e R, ve V} CU. If we take z to be a non-zero element of V N R, we see that 
RzR is a non-zero ideal of R contained in U. We can now prove 


THEOREM 9.18. Let R be a compact open subring of the totally disconnected locally 
compact division ring D. Then R is a local ring and if P = rad R, then R/P is a finite field 
and P= {b e R\b” — 0}. Moreover, {P"|n = 1, 2,...} is a base of the neighborhoods of 0 in 
D. 


Proof. Let P= {b e R|IbR F R}. The argument preceding Proposition 9.5 shows that b” — 0 
for every b e P. If N = R — P, then N = {a e RjaR = R} and N is a subgroup of the 
multiplicative group D* of D since N is a submonoid of D* and every element of N has a right 
inverse in N. It follows that if a € R and b € P, then ab and ba eP. Now let b,,b, € P and 


suppose that a = b, + b, ¢ P. Then c} + c, = 1 for c;=b,a! € P. The fact that the set of ideals 
of R is a base for the neighborhoods of 0 and that c,” — 0 implies that 1 + c>, 1 + c3 + ¢3?,... is 
a Cauchy sequence. Hence 1 + c, +c,” + ... exists in R and this element is the inverse of 1 — c, 


= c}. Then ctc] = 1 € P, contrary to the definition of P. Then b; + b, € P ifb; e P and we 
have shown that P is an ideal in R. Since the elements of N = R — P are units in R, it is clear 
that this set is the group of units of R, so P is the set of non-units. Since P is an ideal, R is a 
local ring and P is its radical. Since P is an ideal, P is open and R/P is finite. Since R/P is a 
division ring, R/P is a field, by Wedderburn’s theorem (BAI, p. 453). We have seen that the set 
of ideals # 0 of R is a base of the neighborhoods of 0. Hence the last statement of the theorem 
will follow if we show that if B is an ideal £ 0 in R, then there exists an integer n such that P” 
c B. We have P > B since P is the only maximal ideal of R. Now P/B is the radical of R/B, 


which is a finite ring. Since a finite ring is artinian, its radical is nilpotent. Hence (P/B)” = 0 


for some integer n. Then P” c B. 


Now let R; be a compact and open subring of D, P} its radical, and let R) = {a € DilaP, € 


Pı}. Then R, is a compact open subring of D containing R, and its radical P) > P,. Continue 
this process to define an ascending sequence of compact subrings R; c R, C ... with radicals 
P; c P) c.... Put R = UR, This is an open subring of D. We claim that R/R} is finite. 
Otherwise, we have a set of distinct cosets a, + Rj, a, E R, n = 1, 2, 3,.... Suppose a, — a. 
Then a — R since R is open, hence closed. Since P} is a neighborhood of 0, there exists an N 
such that a„— a € P) for all n > N. Thena,,—a, € P} for m,n = N. Since P; C R; this implies 
that a„ t R; =a, + R}, contrary to the assumption that the cosets a, + R, are distinct. Thus the 
sequence {a,} does not converge and since we can replace this sequence by a subsequence, 
we see that a, — œ. Hence a, ~! — 0 and so a, | € P} for n sufficiently large, say, n > N. 
Since R = UR, we have an m such that ay € R,,. Since ay! € P} C pp we have 1 = apay ! € 
P This is impossible and so we have proved that R/R} is finite. We have R)/R; C R,/R, C... 
c R/R, and since R/R} is finite we have R = R,, for some m. We have therefore proved the 
existence of a compact open subring R of D such that if P is the radical of R, then 


(31) R = {ae D|aP c P}. 


(31) R= {ae D|aP c P}. 
We now prove 


THEOREM 9.19. Let R be a compact open subring of D such that (31) holds for P the 


radical of R. Then R is a valuation ring in the sense that ifa € D,¢ R, then d'e R. Risa 
maximal compact open subring of D and is the only subring having these properties. 


Proof. Ifa € R, there exists a b} € P such that ab, ¢ P. If ab; R, we repeat the process. 
Eventually we obtain n elements b; € P such that ab, ... b, € R, € P. Otherwise, we get an 
infinite sequence {b,} c P such that ab, ... b, € R. On the other hand, {P”} is a base for the 
neighborhoods of 0, so lim b; ... by = 0, which implies that lim ab, ... b = 0. This contradicts 
ab, ... b; É R for all k. Thus we have an n such that ab; ... b, E€ R, €P. Then ab,...b, =uisa 


unit in R anda! = b} ... b,w!e PCR. Hence R is a valuation ring in D. We have shown also 


that if a ¢ R, then a! e P, and since b” — 0 for every b e P, a” > wifaé R. Ifu € R, ¢ P, 
we have u” * œ and u” + 0. In fact, since R — P is a compact set, every sequence of powers 
{uk} contains a convergent subsequence with limit in R — P and hence # 0. It is clear from 
these results on powers that P= {b e D|b” — 0}, R= {a e Dia” +œ}, D—R= {ae Dia" > 
oo}. If R'is any compact open subring of D, then R' N(D — R)=¢ so R' CR. This shows that R 


is maximal and it is the only maximal compact open subring of D. 


We can use the valuation ring R to define an absolute value on D (defined as for fields). We 
have NP” = 0. Hence given any a + 0 in R, there exists a k > 0 such that a e P*, a ¢ PK *! 
where we put P? = R. Then we define v(a) = k. If a ¢ R, then a“! e P and we define v(a) = 


—v(a~'). Also we define v(0) = œ and |a| = c” where c is a fixed real number such that 0 < c < 
1. Then we have 


THEOREM 9.20. | | is a non-archimedean absolute value on D: |a| = 0, |a| = 0 if and only if a 
= 0, ja + b| < max((al, |b), |ab| = Jal | bl. 


Proof. It suffices to prove the corresponding statements on the map v from D into Z, namely, 
v(a) = © if and only if a= 0, v(a + b) > min (v(a),v(b)), v(ab) = v(a) + v(b). The first is clear 
from the definition of v. Let a,b € D satisfy v(a) = v(b) =k>0.Ifab-!e P, a e Pb c PEt! 
contrary to v(a) = k. Hence ab“! ¢ P and similarly ba“! ¢ P. Then ab“! €e R — P and v(ab“') = 
0. If v(a) = v(b) = k < 0, then v(a’) = v(b-!) = —k > 0 and again v(ab“!) = 0. Now choose z € 
P, ¢ P®. Then x € P* for k>0, so v(a*) > k. If v(t) = 1 > k, then ™ = Lh? b? where b0) e P 
and every b,) ¢ P?, so v(b,”) = 1 = v(x). Then ™ ' = Eabh) b and 1b, © R- P. Hence 
v(a% !) > k — 1. This leads to the contradiction that v(m) > 1. Thus v(z*) = k. This and our 
earlier result show that if v(a) = k + œ, then there exists a u € R — P such that a = ux. We 


observe next that the characterization of the sets P, R — P, and D — R by the properties of the 
power sequences of the elements in these sets shows that these sets are stabilized by inner 
automorphisms of D. Now let a + 0, b #0, and let v(a) = k, v(b) = I > k. Then a = un“, b = va! 
where u and v are units in R. We have a + b = (u + vr! ™ *)ak and u + val~* € R, so v(a + b) >k 
= min(v(a),v(b)). We have ab = ua* va! = uv'x* * ! where v' = a*vr* e R-P. If k + 1> 0, we 
have seen that z% +! e Pkt! é pk*+!*1. Then the same relations hold for ab, so v(ab) = k + I = 
v(a) + v(b). This is clear also if k + 7 = 0. On the other hand, if k + 7 <0, then (aby! = ba~! = 
wE% * Dw with We R — P. Then v((ab)!)= (k + 1) and again v(ab) = k + 1 = v(a) + v(b). 
Hence in all cases v(ab) = v(a) + v(b). We therefore have v(a + b) > min(v(a) v(b)) and v(ab) 
= y(a) + v(b) ifa + 0 and b £ 0. These relations are evident if either a = 0 or b = 0. Hence we 


have the required relations for all a, b. 


The definition of | | shows that the spherical neighborhood of 0 defined by |a| <c*, k > 0, is 


P*, Hence it is clear that the topology defined by the valuation is the same as the given 
topology on D. It follows also that D is complete relative to | |. 


Let F be a closed subfield of D such that Pp = PM F # 0. Then the absolute value | | is non- 
trivial on F and F is complete. Moreover, if Rr = R N F, then Rp/Pp is isomorphic to a 
subfield of R/P. Hence Rp/Pọp is finite and F is a local field. Then the results of section 9.12 
are available. 

Let p be the characteristic of R = R/P. Then |R| = q = p” for m= 1. Let Cy be an element of R 
such that Zo = ¢) + P is a primitive (q — 1)-st root of 1. We shall now show that R contains an 
element ¢ such that C7~ ! = 1 and Z = Zo. Then ¢ will be a primitive (q — 1)-st root of unity in R. 
Now ģ&1 7 !— 1 e Pand if 617 != 1, we can take ¢ = G. Otherwise, let F be the closure in D 
of C(G) where C is the center of D. Then Pp = P N F contains ()7~ ! — 1 +, so Pp £ 0, and if 
Rp = F OR, then R; = R,/P; contains Cy + Pp and this is a primitive (q — 1)-st root of unity. 
The results of the previous section show that Rp contains an element ¢ such that C7~ ! = 1 and ¢ 
+ P= bo + Pr. Then Z = Zo. 

We have seen that R and P are stabilized by the inner automorphisms of D. Hence any inner 


automorphism determines an automorphism of R = R/P. In particular, if z e P — P?, then we 
have the automorphism 


(32) ñ:ā=a+P > nan’ 


of R = R/P. Since |R| = q = p”, this has the form 


t 


(33) anf, =p. 


On the other hand, if u e R — P, then w= and «aw '=a. Hence the corresponding 
automorphism in R is the identity. Since z is determined up to a multiplier in R — P by the 


condition that z € P — P?, it is clear that ñ is independent of the choice of z in P — P?. 
We shall now show that z can be chosen so that 


Ft v's 
(34) nin! = CS. 
Let G = <C>, the subgroup of D* generated by ¢. Then if mọ € P — P? and 2 e G, then 
KoARe | =} and hence 4 *toåzo ' = 1 (mod P) and 4° tot = Ro(mod P?), Then 
(35) n= ) A Ro 
AEG 
satisfies w°mu = x for u € G. Moreover, x = (q — 1)mp (mod P’) and since q1 € P,z =— mo 


(mod P?) and hence z € P— P*. Then *au = x gives mux! = u’ for u € G, so in particular we 
have (34). 

The inner automorphism a ~ mar ! stabilizes G and induces the automorphism % in R. Since 
A ~ Ā is an isomorphism of G onto the multiplicative group R*, it is clear that the order of the 
restriction of a ~ mran! to G is the order r off. Then z” commutes with every A € G and this is 
the smallest power of z with this property. 

The proof given on pp. 571—572 shows that every element of D has a representation as a 
power series in z with coefficients in K = {0} U G. Thus we can show that any non-zero 
element of D can be written in one and only one way as a Series 


(36) a= (Aot Aint hn +n" 


where the 4; € K, Ag = 0, and k > 1. It is clear from this that an element a is in the center if and 


only if it commutes with z and with every 4 € G. It follows that the center C is the set of 
elements 


(37) C= (Mot pn t+ uan + e)n 


where the u; are elements of K such that u; = u; Evidently C + P = 0 and C is a local field. The 


extension field W = C(C) is stabilized by the inner automorphism determined by z. If o is the 
induced automorphism in W/C, then C is the set of elements fixed under o. Hence W/C is cyclic 
with Galois group <o>. It is clear also that o has order r. Hence [W : C] =r. Since W is the set 
of elements 


(38) b= (Ao tdya’ + Ayn +)" 


where the 4; € R, it is clear that the ramification index of W over C is 1, so Wis an unramified 


extension of C. Comparison of (36) and (38) shows that every element of D can be written in 
one and only one way in the form 


(39) WotWint es +w,- 


where the w; e W. The multiplication in D is determined by that in W and the following 
relations for we W: 


(40) nw = o(w)n,  meC. 


Thus D is a cyclic algebra D = (W, o, x’) and [D: C] = r*. We have therefore obtained the 
following structure theorem for totally disconnected locally compact division rings. 


THEOREM 9.21. Let D be a totally disconnected locally compact division ring. Then the 
center C of D is a local field and D is a cyclic algebra D =(W, ©, y) over C where W is 
unramified and y is a generator of the maximal ideal P, of the valuation ring of C. 


Of course, this shows that D is a finite dimensional algebra over a local field. We have 
seen also that a local field is either a Laurent series field over a finite field or a finite 
dimensional extension field of a field of p-adic numbers (Theorem 9.16, p. 597). Hence D is 
either a finite dimensional division algebra over a field of formal Laurent series Fo((x)), Fo 
finite, or a finite dimensional division algebra over some p-adic field @,. The first case holds 
if and only if the characteristic of D is a prime. We have seen also that the topology is given by 
the absolute value | | defined by the unique maximal compact and open subring R of D. It is 
easily seen that this topology is the same as the product topology obtained by regarding D as a 
product of a finite number of copies of Fo((x)) or of 9,. 


It is not difficult to prove the following converse of Theorem 9.21. Let D be a finite 
dimensional division algebra over Fo((x)) or @,,. Then we can introduce a topology in D so that 
D becomes a non-discrete totally disconnected locally compact division ring. We sketch the 
argument. Let F = Fo((x)) or @,,. We have the topology on F given by an absolute value defined 
as before and the valuation ring R and the maximal ideal P (R = F[[x]] in the first case and the 
ring of p-adic integers in the second). Now R can be regarded as the inverse limit of the set of 
finite rings R/P* (see p. 73) and its topology can be identified with the topology of the inverse 
limit of finite sets. Hence R is compact and totally disconnected. It follows that F is locally 
compact totally disconnected and not discrete. The fact that the map x + x"! of F* into itself is 
continuous can be proved as in the case of the field R. 

Now let A be a finite dimensional algebra over F and endow A with the product topology. 
Then A is locally compact totally disconnected and not discrete. It is readily seen that A is a 
topological ring. Let Ma) denote the determinant of the matrix p(a) in a regular representation 
of A. Then a ~ Ma) is a continuous map of A into F, and the set U of invertible elements of A is 
the open subset defined by M(u) + 0. It is easy to see that u~ u`! is a continuous map of U into 
U. In the special case in which A = D is a division algebra over F, U = D* and A is a 
topological division ring in the sense defined at the outset. 


9.14 THE BRAUER GROUP OF A LOCAL FIELD 


We shall now apply Theorem 9.21, the remarks following it, and the results on cyclic algebras 
given in section 8.5 (p. 484) to determine the Brauer group Br(F) of a local field F. We recall 
that if E is a cyclic extension of a field F, then the subgroup Br(F, E) of the Brauer group of F 
consisting of the classes of finite dimensional central simple algebras A having E as splitting 
field is isomorphic to F*/N(E*) where M(E*) is the group of norms N;,-(a) of the non-zero 


elements a € E. The isomorphism is implemented by choosing a generator s of G = Gal E/F 
and defining the cyclic algebra (E, s, y), y € F*. Then the map y(ME*)) ~ [(E,s,y)], the 
similarity class of (£,s,y), is an isomorphism of F*/N(E*) onto Br(E/F). Let K be a subfield of 
E/F and let §: be the restriction of s to K. Then §: is a generator of the Galois group of K/F. The 
order of §: is [K : F] =r and we have n = rm where m = [E : K]. Any central simple algebra 
split by K is split by E and we have the monomorphism of Br(K/F) into Br(E/F) sending the 
class of (K,5 y) into that of (E, s, y’”) (p. 485). 

Now let F be a local field. We determine first the group F*/MW*) where W is an 
unramified extension of F. We shall need the following result. 


LEMMA. Let F, be a finite field with q elements, Fn an extension field with q” elements. 
Then any a € F*, is a norm of an element b € F*,n that is not contained in any proper 
subfield of Fn. 


Proof. The automorphism x ~ x? generates the Galois group of F’,»/F,. Hence the norm map of 
F* nis 
q 


X HAE re xt = x“ MET), 


The kernel of this map is the set of elements such that Np n rœ) = | and this has order (q”— 
1)/(q — 1). Hence the image has order q — 1. Since the image is contained in F*,, which has 
order q — 1, it is clear that the norm map of F*,» is surjective on F*,. Moreover, for any a € 
F*, there exist (g" — 1)/(q — 1) elements b such that NF p/Fq®) = a. On the other hand, the 
elements b contained in proper subfields of F,» are contained in maximal proper subfields. The 


cardinality of any of these is of the form q” where m is a maximal proper divisor of n, and 
distinct subfields have distinct orders. It follows that the number of non-zero elements 
contained in proper subfields does not exceed È(q” — 1) where the summation is taken over the 
maximum proper divisors m ofn. Evidently this number is < (q” — 1)/(q—1) =1+q+...+4q" 


~! Hence we have a b € F q” not in any proper subfield such that NF n rb) =a. 


The requirement that b is not contained in any proper subfield of F» is equivalent to F(b) 
= Fn. This occurs if and only if the degree of the minimum polynomial of b over F} is n. We 


can now prove 


PROPOSITION 9.8. Let W be an unramified extension field of the local field F, Rp the 
valuation ring of F, Pp its ideal of non-units. Then any element u € Rp- Pp is a norm in W. 


Proof. Let Ry be the residue field R,/P, and similarly let R, = R,/P,, where R,, is the 
valuation ring of W, P, its ideal of non-units. Since Wis unramified, we have [R,, : Rp] =n = 
[W: F]. By the lemma, if a = # = u + Pp, then there exists a b e R,, such that the minimum 
polynomial of b over Rp has degree n and N&..%,(0)= a, Ifo e Gal W/F, then the map é:*~6x, x € 
R,,, is in Gal Ry/R and «~¢ is an isomorphism between these Galois groups. Hence for any v 


E€ Rẹ, we have wrw) = Nawal), so if we choose v € R, such that P = b, then Nwirle)= a= a, We can 
choose a monic polynomial f(x)e R[x] of degree n such that Jœ is the minimum polynomial of 
P = b. Since deg J% = n, this is the characteristic polynomial of b (in a regular representation) 
and its constant term is (~ D'N awa, (© = (1'8. Hence we may assume that the constant term of f(x) 
is (-1)"u. Since /@ is a separable polynomial, we can apply Hensel’s lemma to conclude that 
there exists a v e R,, such that f(v) = 0. Since /@) is irreducible in Rp, f(x) is irreducible in 
F[x] and so this is the minimum polynomial over F of v. Since its degree is n, it is also the 
characteristic polynomial. Hence its constant term (—1) ”u = (—1)"N,,-(v). Then u = N,,/-(v) as 


required. L 


Since W is unramified over F, we can choose z € F such that z € P, — (P). Then any w € 
W has the form ux* where u € R,,— P,, and k e Z. Then Nw") = Nixie” and Nyu) € Rp- 
Pp. Conversely, if v = u*" where u € Rp- Pp, then 2” = Ny Azx*) and Proposition 9.8 shows 
that u is a norm. Hence v is a norm. It is clear from these results that F*/M(W*) is a cyclic 
group of order n with generator zN(W*). Then Br(W/F) is a cyclic group of order n. We can 
obtain an isomorphism between F*/M(W*) and Br(W/F) by mapping the coset z*N(W*), 0 < k 
<n-—1, onto the class of central simple algebras over F determined by the cyclic algebra (W, 
o, 2) where o is the Frobenius automorphism of W/F. 


We can combine this result with the results of the previous section to obtain a determination 
of Br(F), namely, we have 


THEOREM 9.22 (Hasse). The Brauer group of a local field is isomorphic to the additive 
group of rational numbers modulo 1 (that is, Q/2). 


Proof. We have seen that any finite dimensional central division algebra D over the local 
field F is a totally disconnected locally compact division ring. Hence Theorem 9.21 shows that 
D has an unramified (hence cyclic) extension field W/F as splitting field. It follows that any 
class [A] in Br(F) is contained in Br(W/F) for some W. Then A ~ (W, o, z*) where o is the 
Frobenius automorphism and 0 <k<n=[W: F]. Once W has been chosen, then k is uniquely 


determined. We now map [A] into the rational number r = k/n. We wish to show that the 
rational number thus determined is independent of the choice of the splitting field. It is readily 
seen by using the results of section 9.12 that for any positive integer n there exists a unique (up 
to isomorphism) unramified extension W/F with [W : F] =n. Moreover, if W’/F is unramified 
and [W : F] = m, then W' c W is isomorphic to a subfield of W if and only if m|n. It follows 
that it suffices to show that if W c W is a splitting field for A, then the rational number 
determined by W is the same as that determined by W. Now the restriction ë of the Frobenius 
automorphism o of W/F is the Frobenius automorphism of W'/F. Hence A ~ (W’, ë , n’) where 0 
< Į < m and so the rational number determined by A and W’ is l/m. Since (W’, ë, n) ~ (W, a, 
n”) = (W, o, 2), we have In/m = k and k/n = l/m. It is clear also that our map is surjective 
on rational numbers satisfying 0 < r < 1. For r= k/n with O < k <n and if we take W to be the 
unramified extension of degree n over F, then the cyclic algebra (W, o, x“) is central simple 
with W as splitting field and this maps into r = k/n. If we have two central simple algebras A 
and B over F, we can choose an unramified extension field W that is a splitting field for both. 
Then A ~ (W, o, z*) and B ~ (W, o, n^) where 0 < k, 1 <n, and A ® B ~ (W, o, 7% + Ù ~ (W, o, 
n”) where 0 < m < n and m/n = (k + 1)/n (mod Z). It follows that the map {A} ~ (k/n)+ Z is an 


isomorphism of Br(F) onto 9/Z. 


Another important consequence of our results is 


THEOREM 9.23. The exponent of a finite dimensional central simple algebra over a local 
field coincides with its index. 


Proof. We have to show that if D is a central division algebra over a local field F such that 
[D : F] =n’, then the order of {D} is n. By Theorem 9.21, D = (W, t, x) where qt is a generator 
of the Galois group of W/F and t € Pp — P,*. Then the exponent of D is the order of zN(W*). 


This is evidently n. E 


9.15 QUADRATIC FORMS OVER LOCAL FIELDS 


We shall first define an invariant, the Hasse invariant, of a non-degenerate quadratic form on a 
finite dimensional vector space over an arbitrary field F of characteristic # 2. In this we 
follow a method due to Witt that appeared in a beautiful paper of his on quadratic forms in vol. 
176 (1937) of Crelle’s Journal. 

The definition and properties of the Hasse invariant are based on quaternion algebras and 
Clifford algebras. We need to recall some results on quaternion algebras and develop some 
formulas for tensor products of these algebras. We have denoted the quaternion algebra 
generated by two elements i, 7 satisfying the relations 


(41) P= fed i=j 


where a and b are non-zero elements of F, as (a, b) (p. 232). In dealing with tensor products of 
quaternion algebras we abbreviate (a, b) ® p (c, d) to (a, b)(c, d) and as usual we write ~ for 


similarity of central simple algebras. Evidently we have 

(i (a, b) = (b, a). 
It is clear also that 

(ii) (a, b) = (as?, bt?) 


for any s #0, t #0. 
A quaternion algebra (a,b) is either a division algebra or (a, b) = M,(F), that is, (a, b) ~ 1 
in the Brauer group Br(F). Evidently (1, b) ~ 1 and hence 


(i11) (a’, b) ~ 1. 


The algebra (a, b) has the base (1, i, j, k = ij). Ifx = xo + xi + xj + x3k and 


(42) T(x)= 2x9, N(x) = x9? —ax,?—bx2? + abx3?, 
then x? — T(x)x + Mx) = 0. Let (a, b)y denote the subspace of elements of trace 0: T(x) = 0. 
This has the base (i,j,k) and has the quadratic norm form M(x) = — axı? — bx? + abx;7. It is 


clear that (a,b)~ 1 if and only if (a, b) contains an element z 4 0 such that z? = 0. This is the 
case if and only if T(z) = 0 = Mz). Hence (a, b) ~ 1 if and only if the quadratic norm form on 
(a, b)q is a null form, that is, — ax, ? — bx? + abx? = 0 has a solution = (0, 0,0). Evidently this 
implies 

(iv) (a,—a) ~ 1. 


Since (a, b), can be characterized as the set of elements x € (a, b) such that x ¢ F but x? € 
F, it is clear that an isomorphism of (a, b) onto (c, d) maps (a, b)g onto (c, d)o. It follows that if 
(a, b) = (c,d), then the quadratic forms — ax, * — bx? + abx? and — cx? — dx + cdx are 
equivalent. It is quite easy to apply the theory of composition algebras to prove the converse 
(see exercise 2, page 450 of BAI). 

Ifa is a non-square, then we have the field 4= Fiva, which has the automorphism o such 


that "(va = -v@, Then the quaternion algebra (a, b) is the same thing as the cyclic algebra (Z, 
o, b) (p. 480). Hence the multiplication formula for cyclic algebras (p. 475) gives the formula 


(v) (a, b)(a, c) ~ (a, be) 


if a is a non-square. Evidently this holds also if a is a square, since in this case all three 
algebras are ~1. Since (a, b)(a, b) ~ 1, (a, a).(b, b) ~ (a, a)(a, b)(b, b)(a, b) ~ (a, ab)(b, ab) ~ 
(ab, ab). Hence we have 


(vi) (a, a)(b, b) ~ (ab, ab). 


Iteration of this gives 
(vi’) (a1, a1)(@, A2)*** (ar, Gp) ~ (Ay °° ap, Ay *** Ay). 


We now consider a quadratic form Q on an n-dimensional vector space V over a field F of 
characteristic 2. The associated symmetric bilinear form B is defined by B(x, y) = Q(x + y) — 
Q(x) — O(y). Then Q(x) = 4B(x, x). We assume throughout that O is non-degenerate in the sense 
that B is nondegenerate. We now define the discriminant of O to be the discriminant of 4 B. 
Thus if (vj,..., v) is an orthogonal base of V relative to Q (= relative to B) then the discriminant 
d= d(O) defined by this base is [Ti €B: ») = [Ti Qeo. 

We shall now define the Hasse invariant of QO as a certain element of the Brauer group 
Br(F). If = 1, the element is the unit 1 of Br(F) and ifn > 1, we define the Hasse invariant of 
Q relative to an orthogonal base (v),...,V,) as the element of Br(/’) determined by the tensor 


product 
(43) | | (Q(vi), Q(v;)). 
i<j 
We proceed to show that this is independent of the choice of the orthogonal base. 


PROPOSITION 9.9. Let (v... ,v,),(v"}, ---V" n) be orthogonal bases of V relative to Q. Then 


(44) [| (Q0), Qe) ~ || (QW, 00). 
i<j i<j 
Proof (Witt). Let U be an n-dimensional vector space equipped with a quadratic form P for 


which we have an orthogonal base (u, ...,u,,) such that P(U) =— 1, 1 <i <n Form W=U® V 


and define a quadratic form R on W by R(u + v) = P(u) + OW”), u e U, e V. ThenW=U , V 
and the restrictions of R to U and Vare P and Q respectively. We shall show that for any 
orthogonal base (v ,..., v„) for V we have 


(45) [| Qe), Qlv;)) ~ C(W, R) @r(d, d) 
i<j 


where C(W, R) is the Clifford algebra of R and d is a discriminant of Q. Evidently this will 
imply (44). Put 4% = Ql), di = []i a} so da =d, We show first that 


n 


(46) C(W, R) = I aj, d 


We know that C(W, R) is a central simple algebra generated by the elements u; v; 1 =<i <n, 
and we have the relations u;7 =— 1, v? =a, UU =-U,U,, vv, = — VV Ati Fj, uwe = — VU; 


for all i, k (pp. 229-230). Put 


"7 j, —_ eee 5 ) 
(47) "i sS (viu) (Vn- 1Un-1)Un, W2 = Vrn. 
Since (viu)? = — vt u? = aj, (viu)(vju;) = (cjut) Va(Vit;) = (DU). ifi<n and V,W2 = — Wz Up, We 
have 
J 
(48) Wer=d=dy W? =a, WW. = — WW]. 


Hence the subalgebra generated by W, and w, is (a,,d,,) and C(W, R) = (a,,d,,)®@ -C' where C' 
is the centralizer in C(W, R) of the subalgebra generated by w, and w, (p. 233). The elements 
u; v; 1 <i<n-—l, commute with w, and w, and the subalgebra generated by these elements is 
isomorphic to the Clifford algebra C(W' R' ) where W's Èi 'Fu;+ X1 ' Fo, and R' is the 
restriction of R to W’. Since [COV R': F] =2 2?” À and [C': F] 22"/4 = 2?” - D, we have C = 
C(W' R' ) and C(W, R) = (a,, d,,) ® C(W' R' ). The formula (46) now follows by induction on n. 
This and (v) and (vi' ) give 


(49) C(W, R) ~ | | (a, a;) ~ (d,d) | | (ai, aj). 


i<j i<j 
Hence [li<; (ai a;) ~ C(W, R)@ (d,d). O 


In view of Proposition 9.9 it makes sense to define the Hasse invariant s(Q) of Q to be the 
unit of Br(F) ifn = 1 and the element of Br(F) defined by (43) if> 1. 

The Hasse invariant is either 1 or an element of order two in the Brauer group. If F is 
algebraically closed or is finite, then Br(F) = 1, so in this case the Hasse invariant is trivial for 
any quadratic form over F. If F =R or a local field, then there is a unique element of order two 
in Br(F). We denote this as — 1. Let O* and QO be positive definite and negative definite 
quadratic forms respectively on an n-dimensional vector space over R. Then the Hasse 
invariant (Q*)=(+1, £1)" 0? =(+1)"""?. If n=0 (mod 4), then s(Q*) = s(Q-), but these 
forms are inequivalent. On the other hand, we shall show that the discriminant and Hasse 
invariant constitute a complete set of invariants for quadratic forms over local fields: Two 
such forms are equivalent if and only if they have the same discriminant and the same Hasse 
invariant. 

We develop first some results for arbitrary base fields (of characteristic # 2). 


PROPOSITION 9.10. Let n < 3. Then two non-degenerate quadratic forms on an n- 


dimensional vector space are equivalent if and only if they have the same discriminant and 
Hasse invariant. 


Proof. The necessity of the condition is clear and the sufficiency is clear if n = 1. Now let = 
2 and let diag {a, b}, diag {a’, b' } be diagonal matrices for the two quadratic forms. We are 
assuming that (a, b) ~ (a' b' ) and ab and a'b' differ by a square. Then (a, b) = (a’, b') and we 
may assume that ab = a'b'. The condition (a,b) = (a' b' ) implies that the quadratic forms 
axı? +bx:?- abxs? and 81° +b'x2" -dbx are equivalent. Since ab = a' b' we have the 
equivalence of @1*+6x2* and a’x,*+b'x2* by Witt’s cancellation theorem (BAI, p. 367). Next let 
n = 3 and assume that 2= axı +bx2? +ex3", Q= g'x,? +b’x2*+cx3*,_ The hypotheses are 
(a, ba, eb, ¢) ~ (a’, ba’, cb’, c) and d = abc and d'= a' b' c' differ by a square, so we may assume 
that d = d'. It suffices to show that — dQ and — dQ' which have discriminant — 1, are equivalent. 
A simple calculation, which we leave as an exercise, shows that 
s(—dQ) ~ s(Q)@(—d, —d). Hence s(—dQ) ~ s(— dQ’), so it suffices to prove the result for Q and Q' of 
discriminant — 1. Then we may assume that 2 = @x1°+5x2*—abx3*, Q' = a'xy? + b'x2?—a'b'xs?_ Then 
s(Q) = | b)(a, — ab)(b, — ab) ~ (a, b)(ab, — ab) ~ (a, b). Hence (a, b) = (a’, b’). Since QO arid Q’ are the 
negatives of the norm forms on (a, b), and (a', b’ )» respectively, it follows that Q and Q’ are 


equivalent. LI] 


We prove next 


PROPOSITION 9.11. Let F be a field such that every quadratic form on a five- 
dimensional vector space over F is a null form. Then any two non-degenerate quadratic 
forms on a vector space V over F are equivalent if and only if they have the same 
discriminant and the same Hasse invariant. 


Proof. The necessity of the condition is clear and the sufficiency holds by Proposition 9.10 if 
dim V < 3. Hence assume n >4. The hypothesis implies that any non-degenerate quadratic form 
P on a four-dimensional vector space U/F is universal, that is, represents every non-zero 
element of F. For if a + 0 we can form U ® Fx, x + 0, and define a quadratic form R on U ® 
Fx by Riu+ax) = P(u)—2’a for ue U, «eF, The fact that R is a null form implies that we have a u + 
a x #0 such that P(u) = 07a. If a = 0 then u £ 0, so P is a null form and hence P is universal. If 
a # 0 then P(o ! u) = a. Thus P is universal. The universality of non-degenerate quadratic 
form on four-dimensional spaces implies that if Q is a non-degenerate quadratic form on an n- 
dimensional vector space V, n > 4, then we have an orthogonal base (v,...,v,) with O(v;) = 1 


for i > 3. If R denotes the restriction of Q to Fv, + Fv, + Fv}, then the definitions and the 
formula (l,a) ~ 1 show that O and R have the same discriminant and Hasse invariant. If Q' is a 
second non-degenerate quadratic form on an ,-dimensional vector space, then we have an 
orthogonal base (v',...,v’,,) with O(v’ ,) = 1 for i > 3. The conditions that s(Q) = s(Q’ ) and Q 
and QO’ have the same discriminant imply the same conditions on the restrictions of Q and Q’ to 
Fy, + Fv, + Fv; and Fv’, + Fv’, + py 3. Hence these restrictions are equivalent and so Q and 


QO’ are equivalent. E 


We require one further result for general fields. 


PROPOSITION 9.12. The quadratic form Q = axı? +a2X2° +a3x3° +a4x4° with d = ajaza3a4. £ 
0 is a null form if and only if PV 4) is a splitting field for ( — a3a4, — aa4). 


Proof. Put a = — aza, b = — aa, © = Gya;a4. Then cQ is equivalent to 
dx? —ax2?—bx3* +abx4?, and —ax,”—bxs’ +abx4? is the norm form on (a, b)ọ = ( — a344, — aza4)o. 
Suppose first that V4" 8° #(V4)=F Th this case cQ is equivalent to ¥1?—ax2?— bx, +abx4?, the 
norm form of (a,b), and (a, b) ~ 1 if and only if this norm form and hence Q is a null form. 


Thus the result holds in this case. Next assume v 4¢F Then FlV4) is a splitting field of (a,b) if 
and only if F(\/4) is a subfield of (a,b) (p. 221). The condition for this is that (a, b)ọ contains 


an element u such that u? = d. This is the case if and only if cQ and hence Q is a null form. 


Hence the result holds in this case also. 


We now suppose that F is a local field. Then the results of the previous section show that 
there is a unique element of order two in Br(F). This has a representative that is a cyclic 
algebra (W, o, m) where W is an unramified quadratic extension of F, o the automorphism # 1 
of WIF, and z is any element of F such that z € P — P? where P is the ideal of non-units in the 


valuation ring R of W. Since we are assuming that char F 4 2, W= Fv and hence (W, ©, z) = 
(a, n). We have 


PROPOSITION 9.13. Let F be a local field of characteristic + 2 and let A be a quaternion 
division algebra over F Then any quadratic extension field E/F is a splitting field for A. 


Proof. We have A = (W,o, z) = (a, z). The extension field E/F is either unramified or 
completely ramified. In the first case E = W, so A contains a subfield isomorphic to E and 
hence Æ is a splitting field. Next assume that E is completely ramified. Then E = F(b) where b 
is a root of a quadratic Eisenstein polynomial. By completing the square we may assume that 
b*=n' e P—P*. We can construct the division algebra (W’ ,o' ,z' ) where W’ is unramified and 
o’ is an automorphism of period two. Then (W’, 0’, z') = (W, ©, z) = A, so again A contains a 


subfield isomorphic to E and Æ is a splitting field. 


The next result we shall need on quaternion algebras requires the stronger hypothesis that 
the residue field R/P is of characteristic + 2. This is 


PROPOSITION 9.14. Jf F is a local field such that char R/P # 2, then (—1, — 1 F~ 1. 


(This does not always hold if char R/p = 2. For example ( — 1, — 1)/@, ~ 1. See exercise 5 


below.) 


Proof. The result is clear if— 1 is a square in F. Hence we assume W = Fi—1) P F Now W is 


unramified since the reducibility of x? + 1 in (R/P)[x] implies by Hensel’s lemma the existence 
of ¥-1 in F. Thus [(R,,/P,,):(R/P] = 2 so the residue degree of W/F is two and hence the 


ramification index is 1, that is y/F' is unramified. It follows that if (— 1,— 1) ~ 1 then (— 1, — 1) 
= (— lz) where r e P, —P,,”. This implies that — m e MW*). This contradicts the 


determination of M(W*) given on p. 610. Hence (— 1, — 1)~1. E 
We can now prove 


PROPOSITION 9.15. Jf F is a local field of characteristic + 2, then any non-degenerate 
quadratic form QO on a five-dimensional vector space over F is a null form. 


Proof. We may assume that 2 = Li ax? and if multiply O by [] a; we may assume that JJa; is a 
l l 


square. Suppose that Q is not a null form. Then )*, a,x is not a null form. By Proposition 


9.12 F (V 41424344) is not a splitting field for (— a3a4, — aya4). Then (— a3a4, — aya4)~ 1, so by 
Proposition 9.13, ajaa} a4 is a square. Hence a; is a square. Similarly every a; is a square 


and hence we may assume that Q=Lix’.Then(—1,-1!)~1!. by Proposition again. This 
contradicts Proposition 9.14 if the characteristic of the residue field of F # 2. Now suppose 
this is 2. Then F contains the field @, of 2-adic numbers and @, contains V—7. To see this we 


2 


note that, by Hensel’s lemma, x^ + x + 2 is reducible in @l,. Hence 9, contains 


i(—1 + J=7) and Q3 contains VZT. Then 1? + 1? + 1? + 2? + (V—7)? = 0 and Ff x? is a null form in 
Q, and hence in F. This completes the proof. E 
By Proposition 9.11 and 9, 15 we have 


THEOREM 9.24. IfF is a local field of characteristic + 2, then any two non-degenerate 
quadratic forms on an n-dimensional vector space VIF are equivalent if and only if they 
have the same discriminant and Hasse invariant. 


We show next that if n > 3, the two invariants are independent. The proof of Proposition 
9.11 shows that it suffices to prove this for n = 3. Then a calculation indicated in the proof of 
Proposition 9.10 shows that if Q = ax ? + bx,* + cx,*, then s(Q) ~ ( — da, —db)( — x, —d) where 
d = abc. It is clear from this formula that for a given d, a and b can be chosen so that s(Q) = 
+1. Hence a, b, c can be chosen so that the discriminant is any d and s(Q) = + 1. This result 
and Theorem 9.24 imply that the number of equivalence classes of non-degenerate quadratic 
forms over F with n > 3 is 2|F*/F*?| where F*? is the subgroup of squares in F*. It is easy to 
see, using an argument based on Hensel’s lemma as in the proof of Proposition 9.8 on norms, 
that if the characteristic of the residue field is = 2, then |F*/F**| = 4. Accordingly, the number 


of equivalence classes of non-degenerate quadratic forms for a given n > 3 is 8. Some 
information on the case n < 3 and the case in which the residue class has characteristic two is 
indicated in the exercises. 


— 


EXERCISES 


. Show that ax,? + bx’, ab + 0, is a null form if and only if ab =—d? and that ax}? + bx? + 


cx," is a null form if and only if s(Q) ~ ( — d, — d), d = abc. 


. Let F be a finite dimensional extension of @,. Show that |F*/F*?| = 21F Q2, 


. Determine the number of equivalence classes of non-degenerate quadratic forms with n = 


2 over a local field. 


. Let F be a field such that (a, b) ~ 1 for every quaternion algebra over F. Show that two 


non-degenerate quadratic forms on an n-dimensional space over F are equivalent if and 
only if they have the same discriminant. 


. Show that the quadratic form x; + x3% + x37 on a three-dimensional vector space over 9, 


is not a null form. Hence conclude that (— 1, — 1)@, + 1. 
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Dedekind Domains 


In this chapter we shall study the domains in which proper non-zero ideals can be factored in 
one and only one way as products of prime ideals. The most notable examples are the rings of 
Z-integral elements of number fields, that is, finite dimensional extension fields of the field @ 
(see BAI, pp. 278281), These are the objects of study of algebraic number theory. Another 
important class of examples are the rings that occur in the study of algebraic curves. Here we 
begin with a field F(x,y) where F is a base field (usually algebraically closed), x is 
transcendental over F, and y is algebraic over F(x). Then the subring of elements that are 
integral over F[x] has the factorization property stated above. 

There are many equivalent ways of defining the class of domains, called Dedekind 
domains, in which the fundamental factorization theorem for ideals into prime ideals holds (see 
section 10.2). We shall take as our point of departure a definition based on the concepts of 
fractional ideals and of invertibility. The latter is equivalent to projectivity as a module for the 
given ring. 

The result that the domains mentioned above are Dedekind can be deduced from a general 
theorem stating that if D is Dedekind with field of fractions F and F is a finite dimensional 
extension field of F, then the subring D’ of D-integral elements of E is a Dedekind domain. 
This is proved in section 10.3. In sections 10.4 and 10.5 we consider the central problem of 
studying the factorization in D' of extensions of prime ideals of D. This is closely related to the 
problem of extension of valuations from F to E that was considered in chapter 9. 

Besides the study of ideal theory, we consider the structure of finitely generated modules 
over Dedekind domains. The special case of torsion-free modules is a classical one that was 
first treated by E. Steinitz (see section 10.6). 

Finally, we consider the class group of a Dedekind domain as defined in section 7.9 and we 
show in section 10.6 that this group has a concrete realization as a classical group defined by 
the fractional ideals. 


10.1 FRACTIONAL IDEALS. DEDEKIND DOMAINS 


Let D be a domain, F its field of fractions. It is useful to extend the concept of an ideal in D to 
certain submodules of F given in the following 


DEFINITION 10.1. Jf D is a domain and F is its field of fractions, a (D —) fractional ideal 
I is a non-zero D-submodule of F such that there exists a non-zero a in D such that al c 
Door, equivalently, al c I N D). 


The fractional ideals contained in D are the non-zero ideals of D. These are called integral 
ideals. If I is a fractional ideal and a is a non-zero element of D such that al c D, then a/ is an 
integral ideal. If b is any non-zero element of F, then Db = {db\d € D} is a fractional ideal 
since it is clearly a D-submodule of F and if b = ac” !, a, c € D, then c(Db) c D. A fractional 
ideal of the form Db is called a principal ideal. 

If J, and /, are fractional ideals, then so is the module sum /, + J, since this is a submodule, 
and if a;l; C D for a; + 0 in D, then a,a,(J, + I,) CD. The intersection J, N J, is a fractional 
ideal also since this is a D-submodule and J, N J, £ 0 since if b; + 0 is in J; we have an a; £ 0 
in D such that a;b; E D N Z. Then (a, b;)(az b2) £ 0 is in J, N 15. Moreover, if a + 0 satisfies 
al, c D, then a(l N D) c D. We define J; J, = {© by; bo)b1; € Ji, bo; € 15}. Evidently this is a 
D-submodule of F and the argument used for J; N J, shows that J, J, is a fractional ideal. 


We shall need to know what the homomorphisms of a fractional ideal into D look like. This 
information is given in 


PROPOSITION 10.1. Let f be a D-homomorphism of the fractional ideal I into D and let b 
be any non-zero element of I. Then f has the form 


(1) a~>b~'f(b)a, 
acel. 
Proof. Choose a d + 0 in D such that d7 c D. Then db € D and da € D for any a e I. We have 
daf (b) = f (dab) = dbf (a). 
Hence af(b) = bf(a) and so fa) = b~! Kb)a. Thus fhas the form given in (1). E 
If J is a fractional ideal, we define 
(2) I~! = {ce F|cI c D} 


It is clear that Z ~lis a D-submodule of F. Since there exist a £ 0 in D such that al c D, it is 
clear that 7~ ! 4 0. Moreover, if b 4 0 is in Z N D, then Z7 lb c D. Hence Z7 ! is a fractional 


ideal. If J is the principal ideal Db, b + 0, then it is clear that 7! = Db !. It is clear also that /, 
cl, =]! S15 . 


If J is a fractional ideal, then 7 ! Z c D so T H is an integral ideal. We shall now call J 
invertible if 


(3) I] =D. 


IfI = bD +£ 0, then J~ ! = b ~ID. Evidently Z ~" = (b ~ '!D)(bD) = D. Hence every principal 
ideal is invertible. The fractional ideals constitute a commutative monoid under multiplication 
with D as the unit. If Z is invertible, then Z7! = D so Z7 ! is the inverse of Z in the monoid of 
fractional ideals. Conversely, let Z have the inverse J in this monoid. Then JI = D implies that J 
c I~! as defined by (2). Since Z7! J c D, we have J~ U= D. Since the inverse of an element 
of a monoid is unique, we have J~ ! = J. Thus the invertible ideals constitute the group of units 
of the monoid of fractional ideals. We prove next 


PROPOSITION 10.2. Any invertible ideal is finitely generated. 


Proof. Let Ibe invertible. Then J~ 'J= D implies the existence of elements c, e J~!, bje 1 
such that }) "| c;b; = 1. Now let b e J. Then b = bl = }'(bc,)b; and a; = bc; € D. Thus [= } Db; 


and (b; ,..., 5,,) is a set of generators for Z. 


Next we establish the equivalence of the conditions of invertibility and projectivity for 
fractional ideals: 


PROPOSITION 10.3. 4 fractional ideal I is invertible if and only if it is projective. 


Proof. First suppose that J is invertible. Then 1 = $ ™, cb, c; € I7 |, b; € I. The map f; : a ~ 
ac; a € I, is in homp(/,D) and a = } (acb; = )f(a)b;. Hence J is D-projectiveby the “dual 
basis lemma” (Proposition 3.11, p. 152). Conversely, suppose that Z is projective. Then we 
have a set of generators {b,,} of Z and corresponding maps f, € hom p(/, D) such that for any a 
€ I, f,(a) = 0 for all but a finite number of a and a=» f,(a)b,. We have shown in Proposition 
10.1 that if b is a non-zero element of 7, then f, has the forma ~ b “| f,(b)a. Since f(b) = 0 for 
all a except, say, a = 1, 2, ..., m, we have f, = 0 fora #1,2,...,.manda=)>™, b -l £(b)ab,. 
Then 1 = ¥ b~! f(b)b; and since b | f(b)a € D for alla e I, c;= b7! f(b)e J |. Thus 1 => 


cib, c;€ I7}, b; € I, and hence J ~'J = D. Thus / is invertible. L 


We say that a fractional ideal / is a divisor of the fractional ideal J if there exists an integral 
ideal K such that J = IK. Since K c D and DI c J, this implies that J c 7. If J is invertible we 


have the important fact that the converse holds: If Z > J, then Z is a divisor of J. For, D=1~ J 


>I7'J,so k=17 is integral. Moreover, J = (I7 'J) = IK. 
We are interested in existence and uniqueness of factorization of ideals into prime ideals. 
For invertible integral ideals we have the following uniqueness property. 


PROPOSITION 10.4. Let I be integral and invertible and suppose that I = P} P} ... Pp 


where the P; are prime ideals of D. Then this is the only factorization of I as a product of 
prime ideals. 


Proof. We remark first that if M is a commutative monoid and a is a unit in M, then any factor 
of a is a unit. Applying this remark to the multiplicative monoid of fractional ideals we see that 
the P ; are invertible. Now let /= Q; Q, ... Q, where the Q, are prime ideals. Then P; > Q} ... 


Q, and since P} is prime, we may assume that P; > Q}. Since P} is invertible, we have Q, = 
P, R, where R} is integral. Then R; D Q}. Since Q} is prime, either P} c Q; or R; c Q}, so 
either P; = Q} or R; = Q4. In the latter case R} = P, R} and since 7 = Q,Q, ... Q, = P1 Ri Q2... 
Q,,, R, is invertible. Then D = R; | R} = R} ~! R} P4 =P), contrary to the hypothesis that P} is 
prime (hence proper). Thus Q = P}, so we have P} P, ... Pa =P, Q2 ... Q, and multiplication 
by P! gives D=Q,... Q, ifm=1 and P; ... P,, = Q» ... Q, if m > 1. In the first case we 
have n = 1 also, and in the second n > 1 and induction on m can be applied since P, ... Pn is 


invertible. LJ 


We shall now define the class of rings that will concern us in the chapter. 


DEFINITION 10.2. A domain D is called a Dedekind domain if every D-fractional ideal of 
F (the field of fractions of D) is invertible. 


By Proposition 10.3, this is equivalent to assuming that every fractional ideal is projective. 
It is clear from Proposition 10.2 that any Dedekind domain is noetherian. We have also the 
following important property: 


PROPOSITION 10.5. Any prime ideal + 0 in a Dedekind domain is maximal. 


Proof. Let be a non-zero prime ideal ( = integral prime ideal) of the Dedekind domain D. If 
I is not maximal, we have an ideal J of D such that D ? J? I. Then I = JK where K is integral. 
Since J# I, we have 1? J. Also I Ż K since KK” '=DK>JK=I, so I > K implies J = K and 
K = JK. Then D = KK '= JKK" ! = J, contrary to D # J. Thus we have J = JK with J ¢ I and 


K £ I. This contradicts the assumption that J is prime. 


We have the following fundamental factorization theorem for Dedekind domains: 


THEOREM 10.1. Every proper integral ideal of a Dedekind domain can be written in one 
and only one way as a product of prime ideals. 


Proof. In view of Proposition 10.4, all we have to do is prove that if J is a proper integral 
ideal in the Dedekind domain D then / is a product of prime ideals # 0. Suppose that this is not 
the case, so the set of proper integral ideals that are not products of prime ideals is not 
vacuous. Then by the noetherian property of D this set contains a maximal element /. Then Z is 
not prime and hence Z is not maximal. Then there exists an ideal J, in D such that D ? J, ? Z. 


Then J = /, J, where J, is a proper integral ideal. We have J > Z, and /, = J implies that D = Z 
T=], 1L1,~!=],. Thus D? I? J for i = 1, 2. By the maximality of J, I; is a product of prime 


ideals. This gives the contradiction that / = J} J, is a product of prime ideals. L 


An immediate consequence of this result is 


COROLLARY 1. Suppose that I = P, P, ... P, where the P, are primes. Then the integral 
ideals + D containing I have the form P; Pz ... P;, where 1 Si, <iy<...<i,<m. 


The proof is clear. 
It is clear also that we have the following consequence of Corollary 1. 


COROLLARY 2. /f/ is an integral ideal in a Dedekind domain D, then D/I is artinian. 


It is evident that the fractional ideals of a Dedekind domain constitute a group under 
multiplication with D as the unit and J~ ! as defined by (2) as the inverse of Z. The fundamental 
factorization theorem gives the structure of this group: 


THEOREM 10.2. The group of fractional ideals of a Dedekind domain is a direct product 
of the cyclic subgroups generated by the prime ideals. These are infinite. 


Proof. Let/ be a fractional ideal + D and let a be a non-zero element of D such that al c D. 
Then a/ is an integral ideal, so either aJ = D or this is a product of prime ideals. Also aD is an 
integral ideal, so either aD = D or aD is a product of prime ideals. Since /(aD) = al, I = (al) 
(aD) !. Then it is clear that we can write J = P;*1 P,*2 ... P fr where the P, are prime ideals 
and the k; are non-zero integers. It follows from Theorem 10.1 that, conversely, if the k; are 
non-zero integers and the P; are prime ideals, then P,“! P,'2 ... P.‘r + D. This implies that the 
representation of a fractional ideal # D as a product P,"! P,'2 ... P.*r is unique. Hence the 
group of fractional ideals is the direct product of the cyclic subgroups generated by the primes. 


It is clear also that these are infinite groups. 


EXERCISES 


1. Show that any finitely generated D-submodule of F + 0 is a fractional ideal. 


An BW N 


7. 


. Show that if D is Dedekind, then the only fractional ideal / satisfying 7? = I is J = D. 
. Use Theorem 10.1 to prove that any p.i.d. is factorial. 
. Give an example ofa factorial domain that is not Dedekind. 


. Let D=2[s -3], the subring of F = Q(x -3) of elements a + by -3 a, be Z. Let I be the 


fractional D-ideal D + Dw where w =— 4 + 4 V ~3(a primitive cube root of 1). Compute 
I~! and J” |. Is J invertible? 


. Let D = 2 [V5] with quotient field F = a[V~-5]. Show that J = D3 + DV-5 is a 


projective D-module that is not free. 


Show that any Dedekind domain that is factorial is a p.i.d. 


In the remaining exercises, D is a Dedekind domain. 


8. 


9. 


10. 


11. 


12. 


13. 


14. 


15. 


Let J, and J, be D-integral ideals and write “=i? where P, are distinct prime 
ideals and the e,, > 0. Show that J, + J, = pymmen 20 |, p miei, e.) and I, N h = 


PPP ear eg 


Show that the lattice of D-integral ideals is distributive (cf. BAI, p. 463). 


(Chinese remainder theorem for Dedekind domains.) Let J). ..., Z, be integral ideals ina 
Dedekind domain D, a, ... ,a,, elements of D. Show that the system of congruences x = aj 
(mod J;), 1 <j < n, has a solution x = a in D if and only if for any J, k, 1 <j, k <n, we 
have a; = a, (mod J; + I;,). (Hint: The necessity of the conditions is clear. The 
sufficiency is proved by induction on n. For n = 2 we have a, — a) = b; — bp, bj € J}. 
Then a = a, — b; = ay — by satisfies a = a,(mod J;), j = 1, 2. For the inductive step one 
uses the distributive law j -=A "= 1 4; + [,. See exercise 9 and p. 461f of BAL) 
Let J be a fractional ideal, J an integral ideal. Show that there exists an element a € 7 
such that 7 | a + J= D. (Hint: Let P}, ..., P, be the prime ideals dividing J. For 1 < i < 
r, choose a; €e IP, ... P,P; ~!, é IP, ... P, and puta =} "a; Then P;? al !,1<i<r, 
and al~!+J=D.) 

Let J be a fractional ideal and a any non-zero element of 7. Show that there exists a b €e I 
such that J = Da + Db. (Hint: Apply exercise 10 to the given and J= al !.) 


Let J be a fractional ideal, J an integral ideal. Show that //Z/J and D/J are isomorphic D- 
modules. 


Let P be a prime ideal + 0 in D. Show that if e > 1 then D/P* > P/P > ... D> P€- I/Pe > 
0 is a composition series for D/P® as D-module. Show that P’/P'*!, i > 0, is a one- 
dimensional vector space over the field D/P. 


Show that if D has only a finite number of ideals P, ..., Ps, then D is a p.i.d. (Hint: Let I 


= P,°1 ... P.e,, where the e; > 0. For each i choose a; € IP, ... P,P; ',¢P£*!. Puta= 


>’; a; Thena e P£, ¢ Pet! and I= Da.) 


10.2 CHARACTERIZATIONS OF DEDEKIND DOMAINS 


We shall give a number of characterizations of Dedekind domains. The first one involves the 
important concept of integral closedness that we introduced in section 7.7. 


It is easy to see that any factorial domain D is integrally closed (exercise 1, below). We 
now prove 


PROPOSITION 10.6. Any Dedekind domain is integrally closed. 


Proof. Let D be a Dedekind domain, F its field of fractions, and let u be an element of F that is 
D-integral. Let f(x) =x" + a,x"~!+... +a, be a monic polynomial with coefficients in D such 
that u) = 0 and put M = D 1 + Du + ... + Du” ~ !. Ifu=be~! where b, c € D, thence” |Mc 
D, so M is a fractional ideal. Evidently M? = M, which implies M = D (exercise 2, p. 604). 


Then u € M = D and hence D is integrally closed. 


We have now established the following three properties of Dedekind domains D: 
(1) D is noetherian. 
(2) Every non-zero prime ideal in D is maximal. 
(3) D is integrally closed. 


We proceed to show that these properties characterize Dedekind domains. The proof is based 
on some lemmas that are of independent interest. 


LEMMA 1. Any fractional ideal for a noetherian domain D is finitely generated as D- 
module. 


Proof. IfI is a fractional D-ideal, we have an a £ 0 in D such that Z ' = al c D. Then J" is an 
ideal in D and since D is noetherian, Z ' is a finitely generated D-module. Then J = a 7 1 ' is 


finitely generated. 


LEMMA 2. Any non-zero ideal of a noetherian domain contains a product of non-zero 
prime ideals. 


Proof. As in the proof of Theorem 10.1, if the result is false we have an ideal / + 0 maximal 
relative to the property that J does not contain a product of non-zero prime ideals. Then / is not 
prime and hence there exists ideals J;, j = 1, 2, such that J, ? I and J, I, c I. By the maximality 


of /, J; contains a product of non-zero prime ideals. Since [> J, Jh, this gives a contradiction. 


E 


LEMMA 3. Let D be an integrally closed noetherian domain, F the field of fractions, I a 
fractional D-ideal. Then 


S = {se F|sI c I} =D. 


Proof. Lets e S. Since / is finitely generated and faithful as D-module and s7 c I, s is D- 
integral by the lemma on p. 408. Since D is integrally closed, s € D. Then S c D and since D 


c S follows from the definition of a fractional ideal, we have S = D. 


LEMMA 4. Let D be a noetherian domain in which every non-zero prime ideal is maximal 
and let I be an ideal of D such that IF D. Then I~! # D. 


Proof. Leta +£ 0 be inZ. Then D > I > aD, and by Lemma 2, aD > P} P, ... P,,, where the P; 


are prime ideals # 0. We assume that m is minimal. By the noetherian condition (or by Zorn’s 
lemma) there exists a maximal ideal P > Z. Then we have P > I> aD >P; ... P, Since P and 


the P; are prime, the hypothesis that non-zero prime ideals are maximal implies that P = P, for 
some 7, and we may assume that P = P}. If m = 1 we have aD = J. Then J~! = a` !D and since I 
*D,a '!¢DandI-!=a !D? D. Next assume that m > 1. Then aD ® P, ... P,, by the 
minimality of m and so we can choose a b € P, ... P, € aD. Put c =a lb. Then c ¢ D and cI c 
cP=a ! bPa bPa ! PP, ... Pm ca !(aD)= D. Thus c e I! and c ¢ D, so again we 


have 77! ? D. E 


We can now prove 


THEOREM 10.3. A domain D is Dedekind if and only if (1) D is noetherian. (2)non-zero 
prime ideal in D is maximal. (Equivalently, D has Krull dimension < 1.)(3)D is integrally 
closed. 


Proof. We have seen that (1)-(3) hold for any Dedekind domain. Now assume that D is a 
domain satisfying these conditions and let J be a D-fractional ideal. The fractional ideal J~ ! Z 
is integral and JJ 7 I(r Y |! c D, so L7 I7 Y} c F l. It follows from Lemma 3 that (177 7 
l c D. Since I~! c D, it follows from Lemma 4 that Z7 7 ! = D. Hence every fractional ideal 


is invertible and D is Dedekind. 


Our next characterization of Dedekind domains will be in terms of localizations. Let D be a 
domain, S a submonoid of the multiplicative monoid of non-zero elements of D. We consider 


the subring D, of the field of fractions F of D consisting of the elements as as ~ La~eD.¢eS. 
It is readily seen that this is isomorphic to the localization of D at S as defined in section 7.2. 
Hence the results on localization that were developed in sections 7.2 and 7.3 are available for 
the study of the rings D,. 

Let J be a fractional ideal for D and put J, = {bs ||b € I, s € S}. Then J,is a D,-submodule 
of F, I, # 0, and if a is a non-zero element of D such that al c D, then al, c D,. Since it is 
evident that F is the field of fractions for D,, it follows that J, is a D,-fractional ideal. If 7} and 
I, are D-fractional ideals, then 


(4) (Li +h)s= Is + las, 


(5) (L 1)s = Iisls, 
(6) (LA Ia)s = Lis 0 has. 


The first two of these are clear. To verify the third we note that (7) N D)s € 4s N hs is clear. 
Now let a € Lig N hs, so a = as] | = a82 “| where a; € I, s; € S. Then b = ays, = ays, E€ A N 
I, and a = B(s}sy) ! € (I, N D),. Hence (6) holds. 

We note next that if / is a finitely generated fractional D-ideal, then 


(7) (I~ ")s = (Is). 


For, it follows from the definition of 77! that 7, + 4) !=1,; 11,1. It is clear also that if b + 
0 in F, then (Db), = D,b. Hence if I = Db, + ... + Db,,, then J, = D,b, +... + D,b,, and I, ~! = 
(\™, D,b; ~!. On the other hand, 77! = (| ” Db; ~}, and so by (6), 7 7D, =.",D,b; 1 = Is 71. 
Hence we have (7). 

Now let 7' be a fractional ideal for D,. Then we have an element as lTatz0inD,seS, 
such that as! I' c D,. Then as ~ ! J’ is a non-zero ideal contained in the localization D,. By 
section 7.2 this has the form J, for a nonzero ideal J contained in D. Then J '= a “IsI, = (a 
's)7),. Thus any fractional D,-ideal has the form 7, for some fractional D-ideal 7. 


We can now prove 


THEOREM 10.4. Let D be a domain. 
(1) If D is Dedekind, then D, is Dedekind for every submonoid of the multiplicative 
monoid of non-zero elements of D. 
(2) If D is Dedekind and P is a prime ideal + 0 in D, then Dp is a discrete valuation 
ring. 
(3) If D is noetherian and Dp is a discrete valuation ring for every maximal ideal P in 


D, then D is Dedekind. 
(We recall that if P is prime ideal, Dp = D, where S is the multiplicative monoid D — P.) 


Proof. (1) Suppose that D is Dedekind and let 7' be a D,-fractional ideal. Then /' = J, for a 
fractional D-ideal J. Since D is Dedekind, D is noetherian and hence / is finitely generated. 
Then (7! = I! = (4), by (7) and 77"! = I(T), = U})s (by (5)) = D,. Hence 7 is 
invertible and D, is Dedekind. 


(2) The localization Dp at the prime ideal P + 0 is a local ring whose only maximal ideal is 
P,. Since D, is Dedekind, P, is the only prime ideal # 0 in D,. Hence D, is a p.i.d. by 
exercise 14, p. 605. Then D, is a discrete valuation ring by Proposition 9.2, p. 570. 

(3) Now assume that D is noetherian and that D, is a discrete valuation ring for every 
maximal ideal P of D. Then D, is a p.i.d. and hence D, is Dedekind. Let 7 be a fractional D- 
ideal. Since D is noetherian, Z is finitely generated. Hence J"! =(I"'),. Then(//~ D, = 1,7"), 
= i = Dp forevery maximal ideal P of D. We can now apply the principal of passage from 
local to global: We have the injection i: IJ~ | « D, which localizes to the injection (J T) S Dp 
Since the latter is surjective for all P, it follows from Proposition 7.11.2 (p. 402) that ZZ 7 ! 


D is surjective. Thus Z7 7 ! = D for every fractional D-ideal J. Hence D is Dedekind. 


Dedekind domains can also be characterized by the factorization property given in Theorem 
10.1. In fact, we have the following stronger result: 


THEOREM 10.5. Let D be a domain with the property that every proper integral ideal of 
D is a product of prime ideals. Then D is Dedekind. 


Proof(Zariski-Samuel). We show first that any invertible prime ideal P of D is maximal in D. 
Leta € D, ¢ P. Suppose that aD + P + D. Then aD + P =P; ... P&D + P = Q; ... Q, where 


the P; and Q, are prime ideals. Since aD + P ? P and a*D + P? P (since P is prime), we have 


P;? P, Q, ? P for all i and j. Passing to the domain D = D/P we obtain aD = P. P, 
and aD =Q, ... Q, where x + X is the canonical homomorphism of D onto D. The neal 


ideals aD and a’D are invertible and the P ; and Q, are prime. Moreover, a,D = (aD), so 2, 
-P _P „ By Proposition 10.4, the sequence of prime ideals {@,, ..., 2,3 


and i: b P ji Meee P PA P my are the same except for order. The corresponding 
sequences of ideals of D containing P are {Q}, ..., Q,} and {P}, Pi; ..., Pm Pmt- Hence these 
coincide except for order. This implies that (aD + P)? = P? ... P2 = Q; ... Q, =&@D +P. 


Then P c (aD + P? = &°D + aP + P? c aD + P’. Then ifp € P, p = ax + y where x € D and y 
€e P°. Then ax € P and since a ¢ P, x e P. Hence P c aP + P cP, so P = aP + P. 


Multiplication by P`! gives D = aD + P, contrary to hypothesis. Thus we have aD + P = D for 
every a € D, ¢ P. This implies that P is a maximal ideal of D. 


Now let P be any prime ideal + 0 in D and let b be a non-zero element of P. Then P > bD 
and bD = P,P, ... P, where the P, are prime and are invertible. Hence the P; are maximal. 
Since P is prime, we have P > P; for some i. Then P = P; and so P is invertible. Since any 
proper integral ideal is a product of prime ideals, it follows that every integral ideal is 
invertible. Since any fractional ideal Z ' = al for I integral and a + 0, I 'Q Y! = I7! = D. 


Hence every fractional ideal is invertible and D is Dedekind. 
The following theorem summarizes the characterizations of Dedekind domains that we have 
given. 


THEOREM 10.6. The following conditions on a domain are equivalent: 


(1) D is Dedekind. 

Gi) D is integrally closed noetherian and every non-zero prime ideal of D is maximal. 

(Gii)Every proper integral ideal of D can be written in one and only one way as a 
product of prime ideals. 

(iv) D is noetherian and Dp is a discrete valuation ring for every maximal ideal P in 


D. 


EXERCISES 


— 


. Prove that any factorial domain is integrally closed. 


2. Prove that the following conditions characterize Dedekind domains: D is integrally 
closed noetherian and D// is artinian for every integral ideal Z. 


3. (H. Sah.) Show that a domain D is Dedekind if and only if it has the following property: 

If J is any integral ideal of D and a is any non-zero element of /, then there exists a b € 
I such that J = Da + Db. 

(Sketch of proof: The proof that any Dedekind domain has this property was indicated 
in exercise 11, p. 605. Now suppose that D has the property. Then D is noetherian. Also 
it is easily seen that the property is inherited by any localization D,. Now suppose that D 
is a local domain having the property and let P + 0 be its maximal ideal. Let J be any 
ideal + 0 in D. Then J = IP + Db for some b e J. It follows by Nakayama’s lemma that 7 
= Db. Thus D is a p.i.d. and hence a discrete valuation ring (p. 571). Now conclude the 
proof by using Theorem 10.4.3 (p. 628). 


Exercises 4-7 were communicated to me by Tsuneo Tamagawa. The first of these is well 


known. In all of these exercises, D is Dedekind with F as field of fractions and x is an 
indeterminate. 


4. If f(x) € Dix], we define the content c(f) to be the ideal in D generated by the 
coefficients of f(cf. BAI, p. 151). Prove that c(fg) = c(/)c(g) for f, g € Dix]. 


5. Let S= {fe D[x]|c() =D}. Note that Sis a submonoid of the multiplicative monoid. Let 
Dix], denote the subring of F(x) of fractions f(x)/g(x) where f(x) € D[x] and g(x) € S. 
Show that if f(x), g(x) e Dix], then f(x)/g(x)e D[x], if and only if c(f) c c(g). 


6. Show that D[x], N F =D. Let I’ be an ideal in D[x],. Show that Z =Z' N D is an ideal in 
D such that ZD|x], =Z '. Show that I+ /' = [D[x], is a bijective map of the set of ideals 
of D onto the set of ideals of D[x],. 


7. Prove that D[x], is a p.1.d. 


10.3 INTEGRAL EXTENSIONS OF DEDEKIND DOMAINS 


The classical examples of Dedekind domains are the rings of 2-integers of number fields, that 
is, of finite dimensional extension fields of @ and of fields of algebraic functions of one 
variable. In this section we derive the general result that if D is Dedekind with field of 
fractions F and Æ is a finite dimensional extension field of F, then the subring D' of D-integral 
elements of £ is Dedekind. This implies the Dedekind property of the classical domains and 
gives the fundamental theorem on the unique factorization of integral ideals as products of 
prime ideals in these domains. 
We prove first 


PROPOSITION 10.7. Let D be a domain, F its field of fractions, E a finite dimensional 
extension field of F, and D' the subring of E of D-integral elements. Then any element of E 


has the form rb! where r e D'and be D. 


Proof. Letu e E and let g(x) = x” + ax®7 | + ... + a„ € F[x] be a polynomial such that 
g(u) = 0. We can write a; = c;b ~ |, c, b € D. Then 


h(x) = b"g(b~*x) = b"(b7™x" + a,b ™ Dx"! ge tay) 


= X" + bax"! + bax"? + + ban 


€ D[x] and A(bu) = 0. Hence r = bu € D' and u = b7 !r as required. E 

Evidently this result implies that Æ is the field of fractions of D'. Since D' is the ring of D- 
integers of E, D' is integrally closed in E. Hence D' is an integrally closed domain. 

We suppose next that D is integrally closed and we prove 


PROPOSITION 10.8. Let D, E, F, D ' be as in Proposition 10.7 and assume that D is 
integrally closed. Let r € E and let m(x),f(x) be the minimum polynomial and characteristic 
polynomial respectively ofr over F. Then the following conditions on r are equivalent: 

(1) r is integral. 

(2) m(x) € D[x]. 

(3) f(x) € Dix]. 

If the conditions hold, then Tp;p(r) and Ng Ar) € D. 


Proof. Since f(x) is a power of m(x), it is clear that (2) = (3). Also obviously (3) = (1). Now 
assume (1). Let F be an algebraic closure of F containing E. We have m(x) = I I(x — r;) in F[x] 
where r; = r. For any i we have an automorphism of F/F sending r into r; Since r is D- 
integral, there exists a monic polynomial g(x) € D [x] such that g(r) = 0. Applying an 
automorphism of //F sending r into r;, we obtain g(r) = 0. Hence every root r; of m(x) is a D- 
integer. Since the coefficients of m(x) are symmetric polynomials in the r, these elements of F 
are D-integers and since D is integrally closed, they are contained in D. Thus m(x) € D[x] and 


(1) +(2). The last assertion is clear, since f(x) =x" — TyyAr)x” | +... H DN g(r). 


A key result for the proof of the theorem on integral closures of Dedekind domains is the 
following 


PROPOSITION 10.9. Let D, E, F, D' be as in Proposition 10.8 and assume that E/F is 
separable and D is noetherian. Then D' is a finitely generated D-module. 


Proof. Since E/F is separable, there exist n = [E:F] distinct monomorphisms oj, ..., ¢, of 
E/F into the algebraic closure F/F. Moreover, if (u, ..., u,,) is a base for E/F, then the matrix 


(U1) G; (Un) 
(8) fe (us) 02 (Un) 
On(Uy) °  On(Un) 


is invertible (BAI, p. 292). We know also that T;,,(u) = >)", o; (u) for u e E (BAI, p. 430). 
We have (‘A)A = (Tpjr(uju;)). Hence 


(9) d = det (Tg)r(uiu;)) # 0. 


We have seen in Proposition 10.7 that there exists a b; # 0 in D such that b;u; € D'. Hence we 


may assume that the u; € D'. Letr e D' and write r = }” jaju; a; € F. Then ru; € D', 1 <j <n, 


and ru; = )/a;uju;. Hence 
(10) Try p(ru;) = A di Te) p(ujuj) € D. 


The system of equations (10) for a;, 1 < i <n, can be solved by Cramer’s rule. This shows that 
a; € d! D. Since this holds for every r € D', we see that D' is contained in the D-module M = 
>” D(d ! u). Thus D' is contained in a finitely generated D-module. Since D is noetherian, M 


is a noetherian D-module. Hence the submodule D' is finitely generated. E 


We can now prove the main result of this section. 


THEOREM 10.7. Let D be a Dedekind domain, F its field of fractions, E a finite 
dimensional extension field of F, D' the subring of D-integral elements of E. Then D' is 
Dedekind. 


Proof. We assume first that E is separable over F. In this case we shall prove the theorem by 
showing that D’ has the three properties characterizing Dedekind domains given in Theorem 
10.3, namely, (1) D' is noetherian, (2) every prime ideal # 0 in D' is maximal, and (3) D' is 
integrally closed. The first of these follows from Proposition 10.9, since this proposition 
implies that D' is a noetherian D-module. A fortiori D' is noetherian as D'-module. Now let P’ 
be a non-zero prime ideal in D'. Let r + 0 be in P’. Then we have f(r) = 0 for the characteristic 
polynomial f(x) ofr. This implies that Ng;p (r) € P' and hence, by Proposition 10.8, Ngp(r) € 
P=P' N D. Since Nz Ar) + 0, we see that P # 0. It is clear also that P is a prime ideal in D. 
Since D is Dedekind, this implies that P is a maximal ideal in D. By the Corollary to 
Proposition 7.17, p. 410, we conclude that P’ is maximal in D'. Hence (2) holds. The remark 
following Proposition 10.7 shows that (3) holds. Hence D’ is Dedekind in the separable case. 


We assume next that char F = p and that E is purely inseparable over F. Since [E:F] < ~, 
there exists a q = p° such that uv” €e F for all u € E. Let F be an algebraic closure of F 
containing E and let F'!/7(D!%) be the subfield (subring) of F of elements v such that v4 € F(D). 
Then v > v4 is an isomorphism of F4 onto F mapping D!7 onto D. Since D is Dedekind, D'4 
is Dedekind. If u € D', u1 € F and uf is D-integral. Hence u1 € D and so D' c D"4. We shall 
now show that D' is Dedekind by proving that every D’-fractional ideal J of EF is invertible. Let 
I' = D41. Then T' is a D'4-fractional ideal. Hence there exists a D!/7-fractional ideal J’ such 
that J' J' = D"4 and since I' = ID"4 and D"4J' = J', we have elements b; € J, c'; € J' such that 
bc; = 1. Then ¥b1c 4 = 1 and Yd,c; = 1 where c; = 54! cf. Now bf 1 ce'a e PTI CT. 
Since c’4€ F and b; € E,c,;=b;T! c'1 € E. Thus c; e J=J' N E. Evidently LJ c D4 N E and 
since the elements of D4 are D-integral, D4 N E c D'. Hence IJ c D', so Jc T | Since the c; 
e Jand bje I and Yc;b; = 1, we have II ! = D'. Thus 7 is invertible and D' is Dedekind. 


The general case can be obtained by combining the preceding two cases in the obvious 
way: If E is arbitrary finite dimensional over F, let K be the maximal separable subfield of 
E/F. Then E is purely inseparable over K. Now D” = D' N K is Dedekind by the first case and 
since D' is the set of D"-integral elements of E£, D' is Dedekind by the second case. This 


completes the proof. 


The foregoing result is applicable in particular to F = @ and D = Z and to F = K(x) where K 
is an arbitrary field, x an indeterminate, and D = K[x]. This is clear since these D are p.i.d. and 
hence are Dedekind. If E is a finite dimensional extension field of F, then the ring D’ of D- 
integers of E is Dedekind. In the case F = 9, D = Z, E is a number field and in the other cases £ 
is an algebraic function field in one variable. 


EXERCISES 


1. Let the notations be as in Theorem 10.7. Assume that E/F is separable and that D is a 
p.i.d. Show that D' is a free D-module of rank n = [E: F]. 


Exercises 3, 4, and 5 of BAI, p. 287, are relevant for this section. 


2. (Sah.) Show that if D is a Dedekind domain, then so is the ring D((x)) of Laurent series 
over D. (Hint: It is easily seen that D((x)) is a domain. If J is an ideal in D((x)), let J ) 
be the set of leading coefficients of the elements of 7. Then /(/) is an ideal and if /(/) = 
Da, + ... + Da,, where a; is the leading coefficient of f; e J, then J = D((x))f, + ... + 


D((x))f,,- Use this and exercise 3 on p. 630. 


10.4 CONNECTIONS WITH VALUATION THEORY 


We give first a valuation theoretic characterization of the integral closure of a subring of a 
field. This is 


THEOREM 10.8 (Krull). Let D be a subring of a field F and let D' be the integral closure 
of D in F (= the subring of D-integral elements of F). Then D' is the intersection of the 
valuation rings of F containing D. 


Proof. Letu e D'and let R be a valuation ring in F containing D, g a valuation of F having 
R as valuation ring (e.g., the canonical valuation defined by R). Suppose that u ¢ R. Then g(u 
1) < 1. On the other hand, we have a relation u” + aju” ~ L4.. a, =0, a; € D, which gives 
the relation 


n 


= ee | so 
(11) | = -aqu t- au? —a,u™. 


Since the a; € D c R, o(a) < 1. Then (aw Í) < 1. Since ọ(1) = 1, we have the contradiction 1 
< max{o(aju =D} < 1. Hence u € Rand D' c N R where the intersection is taken over all of the 
valuation rings R of F containing D. Next, suppose that u é D'. Then uw“! is not a unit in the 
subring D[u }] of F, since otherwise, its inverse u = ag + aw! +... + a,u "7D, a; € D, and 


l is not a unit in D[w!], u 


hence u” = agu"! + aju” 7? + ... + a, contrary to u éD '. Since u` 
-l D[w!}] is a proper ideal in D[w !] and this can be imbedded in a maximal ideal Py of D[u" 
1]. Then 4 = D[w ']/Po is a field and we have the canonical homomorphism of D[w '] onto a, 
which can be regarded as a homomorphism @) of D[u"'] into an algebraic closure A of 4. By 
Theorem 9.10 (p. 561), there exists a A-valued place # extending @. If R is the valuation ring 
on which @ is defined and P is its maximal ideal, then P is the kernel of #. We have R > Diu] 
> D and P > Po. Then w` ! D[w !]c Py c P;, so Au ') = 0 and hence u ¢ R. Thus we have 


shown that if u é D' then u é [l R where R ranges over the valuation rings of Fcontaining D. 


Hence D' =f R. 


Now let D be Dedekind, F its field of fractions. Since D is integrally closed, D is the 
intersection of the valuation rings of F containing D. We claim that these valuation rings are the 
localizations Dp, for the prime ideals P of D. We have seen that every Dp is a discrete 


valuation ring for every prime ideal # 0 in D. This is clear also if P = 0, since Dy = F. Now let 
R be a valuation ring in F containing D such that R # F. Let M be the maximal ideal of R and 
let P= M N D. Then P is a prime ideal in D and if P = 0, every nonzero element of D is 
invertible in R. Since F is the field of fractions of D, this implies that F = R, contrary to 
hypothesis. Thus P is an integral prime ideal in D. We have the valuation ring Dp with maximal 
ideal P, = PD,. Ifa € D- P, thena € R- M anda l€ R. Then R > Dp. Moreover, since M 
D P, M = RM > D,P. Now it is easily seen that if R; and R, are valuation rings with maximal 
ideals M, and M, respectively, then R; D R, and M, > M, imply R, = R, and M, = M>. Hence 
R = D, and M = PD,. Thus the valuation rings of F containing D (and #F) are just the 
localizations D,, P a prime ideal 4 0 in D. By Theorem 10.8, D = pi Do 


A prime ideal P # 0 in a Dedekind domain D determines an exponential valuation v, for the 


field of fractions F of D in a manner similar to the definition of the exponential valuation v, on 
a defined by a prime integer p. Ifa £ 0 in F, we write Da = P% P," ... Pf! where P, P}, ..., P} 
are distinct prime ideals and k and the k; e Z. Then we put v,(a) = k. Moreover, we define 
v,(0) = œ. It is readily verified, as in the special case of 9, that v, is an exponential valuation 
of F. Hence if we take a real y, 0 < y < 1, then |a|, = y (a) is a non-archimedean absolute 
value on F. We call this a P-adic absolute value on F. The valuation ring of | |, evidently 
contains the localization Dp and the maximal ideal of the valuation ring contains PD,,. Hence 


the valuation ring of | |, is D, and its maximal ideal is PD,,. 


Since D, is Dedekind and PD, is its only prime ideal + 0, the integral ideals of D, are the 
ideals Py k >0. Evidently, P, c PD, N D. On the other hand, if a € PD, 1 D for k= 1, 
then a = bc ! where b e P*, c e D -— P. Then ac = 0 (mod P*) and c #0 (mod P). The latter 
condition implies that c + P is a unit in D/P. Then c + P* is a unit in D/P* (p. 195). It follows 
that a e P*. Hence PXD, N D=P* for k>0. 

Let | | be a non-archimedean absolute value having D, as valuation ring. We claim that | | = | | 
p» a P-adic valuation determined by the prime P. To see this we observe that PD, = x D, 
where x € PD,, # P’D,. Then P“D, = xD, for k > 0 and any element of D, can be written as 
un*, u a unit in D, and k > 0. Then |uz*| = yf where |x| = y, 0 < y < 1. Hence, ifa e P4, é PX*!, 
then a € "D; ¢ Pk* 1Dp, and |a| = y*. On the other hand, it is clear that Da = P% P% ... PÉ 
where P, ..., P| are primes distinct from P and hence if | |p is the P-adic valuation determined 
by y, then |a|, = |a| for a € D. Then||=||,. 

We show next that the residue field D,/P, of | |p is canonically isomorphic to D/P. Since P 
=F, N D, we have the monomorphism a + P + a + Fi of D/P into DP Ifa, b e Dand b é P, 
then since D/P is a field there exists a c € D such that cb = a (mod P). Then ab" ! = c (mod 
P,), which implies that a + P + a + P, is surjective on D,/P,. Then this map is an 
isomorphism. 

We summarize this collection of results in 


PROPOSITION 10.10. Let D be a Dedekind domain with field of fractions F. Then (1) Any 
valuation ring in F containing D has the form D, where P is a prime ideal in D. (2) Any 


non-archimedean absolute value on F having Dp, P # 0, as valuation ring is a P-adic 
valuation. (3) The map a + P ~a + P, is an isomorphism of D/P onto D,/P,,. 


Now let E be a finite dimensional extension field of F and let D' be the ring of D-integers of 
E. Then D' is Dedekind. If P is an integral ideal of D, then the ideal in D' generated by P is D' 
p. Now D' P + D'. For, by the “Lying- over” theorem (p. 411), there exists a prime ideal P’ in 
D' such that P' N D = P. Hence P' > PD' and PD' + D'. It may happen that the extension ideal 
PD' of P is not prime. At any rate, we have a factorization 


(12) PD! = Preis» Pre 


where the P’; are distinct prime ideals in D' and the e; > 0. 

We have treated the general problem of extension of an absolute value on a field F to a 
finite dimensional extension field in Chapter 9 (p. 585). We shall now treat the special case of 
a P-adic absolute value of the field of fractions F of a Dedekind domain in an alternative 
manner based on the factorization (12). We prove first 


THEOREM 10.9. Let D be a Dedekind domain, F the field of fractions of D, E a finite 
dimensional extension field of E, D' the subring of E of D-integral elements. Let P be an 


integral prime ideal in D, | |p a P-adic absolute value on F, (12) the factorization of PD' into 
prime ideals in D'. Then for each P'; there is a unique P'-adic absolute value | p’, on E 


extending | |,. Moreover, | |p; and | |,, are inequivalent if i + j and the | are the only 


bi bi þri; 


extensions of | |p to absolute values on E. 


Proof. Leta £ 0 be in F and write Da = P* P*, ... Při; where P and the P; are distinct 
prime ideals and k and the k; € Z. Then vp(d) = k, and by (12), 


D'a = (D' P} (D'P,)"' + (D'P) 
a pat P£: war Pied pen 


where the terms after P’,e,k come from the factorizations of the D'P, into prime ideals in D'. 


(13) 


Now we observe that if P} and P, are distinct prime ideals in D, then P; + P, = D and hence D' 
P, + D' P, = D'. This implies that the prime ideals in D' dividing D'P, are different from those 
dividing D'P}. It follows that vp (a) =e;k. Hence if | |, = y YO for 0 <y < 1, then we shall have 
et b, on F if and only if |y 


the valuation ring of | b, (or of vp) is Dy. Since these are distinct for distinct choices of j, it 


= (Pvp ). This proves the first assertion. We have seen that 


follows that | pr and | b’, are inequivalent if i # 7. Now let | |’ be any absolute value on E 
extending | |p and let R’ denote the valuation ring of | |’. Then R’ > D, and since D' is the integral 
closure of D in E, R' > D' by Theorem 10.8. By Proposition 10.10.1, R' = D'y for P’, a prime 
ideal in D'. Since R' > D,, it follows that P' > P. Hence P' = P’; for some i. By Proposition 


10.10.2, | | is a P';-adic valuation. E 


We recall that the ramification index of | | n relative to | |, is the index CIE® | r::|F* |>] (p. 589). 
Since | F* | p is the subgroup of R* generated by y and Ep, is the subgroup generated by y"®i, 
the ramification index is e;. 

We recall also that the residue degree f; of | |, relative to || , is the dimensionality [D' p/P 
‘ip, ` Dp/Pp] where Dp/P, is imbedded in D',/P'ip by the monomorphism x + P, ~x + Plipy P. 
589). We also have a monomorphism a + P ~a + P', a € D, of D/P into D'/P',. Hence we 
can define the dimensionality [D’P', : D/P]. Now we have the isomorphisms o : a + P~a + 
P, ands: x + P;~x + P'p of D/P onto D/P, and of D'p, onto D',, /P'iy'; respectively. We 
have s((a + P)(x + P')) = s(ax + P') = ax + P and hence(o(a + P))(s(x + P')) = (a + P,) 
(x + P'ip,') = ax + P',,. Hence s is a semi-linear isomorphism of D'/P'; over D/P onto D',,/P 


‘ip, With associated field isomorphism o. It follows that the residue degree f; coincides with the 


dimensionality [D'/P'; : D/P]. 
The results on ramification index and residue degree can be stated as follows: 


THEOREM 10.10. Let the notations be as in Theorem 10.9. Let | |, be a P-adic absolute 
value on F, | |p; a P';;-adic absolute value on E extending | |p. Then e; is the ramification 
index and [D'/P', : D/P] is the residue degree of | | p, relative to |p: 


An immediate consequence of this result and the main theorem on residue degrees and 
ramification indices (Theorem 9.15, p. 592) is 


THEOREM 10.11. Let the notations be as in Theorem 10.9. Then Lies < 
n= [E:F] iffi = (D/P: D/P]. Moreover, ¥8 ef, =n if EIF is separable. 


We remark that this result has the consequence that the number g of prime ideals of D' 
containing D'P is bounded by n and it gives considerably more information on the factorization 


(12). 


EXERCISES 


1. Let E be the cyclotomic field Q(¢) when ¢ is a primitive /th root of unity, / a prime. Let D 
= 2, D' be the integral closure of Z in E. Show that /D' = ((€— 1)D'P ~! and (€ — 1)D' is a 
prime ideal in D'. (Hint: [E : QO] =1—1 and f(x) =x! +x? +... +1 is the minimum 
polynomial of č. We have f(x) = i I(x — £1) in E[x] and every č „ 1 <i</-l,isa 
primitive /th root of 1. (See BAI, p. 154.) Substituting 1 in f(x) gives 7 = [117 10 - &%). 
Moreover, (1 — &)/(1 -—2S)=14+6+...+8 | e D'. Show that 1 + €+... + 1 is a unit 
in D' so ID’ = ((£ — 1)D’)'~ |. Show that / is not a unit in D' and use Theorem 10.11 to 
conclude that (€ — 1)D' is prime.) 


The next three exercises are designed to show that in the special case of Theorem 10.11 in 
which E/F is Galois, the factorization (12) of PD’ = (P", ... P’,)© and all of the residue degrees 
are equal. Hence n = [E : F] = efg. 

2. Let E/F be Galois with Gal E/F = G. Show that any o € G maps D' into itself. Show that 
if P' is a prime ideal in D' then oP’ is a prime ideal. Show that ë : a+ P' ~oa + oP' is 
an isomorphism of D'/P' onto D'/oP". 

3. Let P be a non-zero prime ideal of D, PD' = P'|"! ... P's, the factorization of PD! into 
prime ideals in D' where P’; # P’; ifi #7. Show that ifo € G, then {oP'), ..., oP’g} = {P 
', +++, P’g and that G acts transitively on the set of prime ideals {P%, ..., P’,}. (Hint: To 


prove transitivity of G on {P';, ..., P's} one shows that if P’ is any prime ideal of D' 
such that P' N D = P then any prime ideal Q' of D' such that O'M D = P is one of the 
ideals o P’, o e G. Otherwise, by Proposition 7.1, p. 390, Q U, < G © P’. Hence there 
exists ana € Q' such that oa ¢ P' for any o € G. Then Nga) =l] - go a éP This is a 
contradiction since Ngpla)e O' N D=P.) 


4. Use the results of exercises 2 and 3 to show that e; = e, for every i and j and that the 
fields D'/P', and D'/P'; are isomorphic over D/P. Hence conclude that [D'/P'; : D/P] = 
[D'"/P'; : DIP]. Put e = e; f= fi = [D'P; : D/P]. Show that n = efg. 


10.5 RAMIFIED PRIMES AND THE DISCRIMINANT 


As in the previous section, let D be a Dedekind domain, F its field of fractions, Æ a finite 
dimensional extension field of F, and D' the subring of D-integral elements of E. Let P be a 
non-zero prime ideal in D and let PD' = P'|"! ... P’,°s where the P’; are distinct prime ideals in 


D, as in (12). We consider the ring D = D'/PD', which we can regard also as a module over D' 
and by restriction as a module over D. Since P annihilates D’, D’ can also be regarded as a 
vector space over D = D/P. The action of a+ P, a € D, onx + PD', x € D', is (a + P)(x + PD') 
=ax + PD’. Taking into account the ring structure as well as the D -vector space structure, we 
obtain the algebra D’ over D. We shall now consider the structure of this algebra. The main 
result on this is the following 


THEOREM 10.12. Let D be a Dedekind domain with field of fractions F and let E be a 
finite dimensional extension field, D' the subring of E of D-integral elements. Suppose that 
P is a non-zero prime ideal in D and let PD' = P',"! ... P' £4 where the P'; are distinct prime 
ideals in D' and the e; > 0. Then D = D'/PD' is an algebra over the field D = D/P in which (a 
+ P)(x + PD’) = ax + PD',a € D,x € D'. The dimensionality [D : D] = >. )8ej; where f; is the 
residue degree | D'/P’,: D]. We have [D : D] < n = [E : F] and [D : D] =n if E/F is 
separable. D' is a direct sum of g ideals isomorphic to the algebras D'/P':. The radical of D’ 
is P"; ... P',/PD! and D'/rad D' is a direct sum of g ideals isomorphic to the fields D'/P',. 


Proof. Put 4&5 Pi “Pi Po", Then 
(14) 24A=D,; Ain yy A =D'P. 
jżi 


This implies that 


(15) D' = D'/PD' = A\/PD'®:*:@ Ai/PD' 


and 4i/PD' = AINA; A Pi*) = (Ai+ Pi*)/Pi" = D'/Pi*. Hence 
(16) D' = D'/P EO @D'/P,”. 
We have the chain of ideals 
(17) Da>Pi>P?P >> Pi*, 1<i<g, 
each of which is minimal over the next. Hence 
(18) D'/Pi* = PiP == > PP 0 


is a composition series for D'/P',*/ as D'-module. The composition factors are isomorphic to D 
fia Pipe. ere ee Lip ey, each of which is annihilated by P’; and so can be regarded as a 
vector space over the field D’/P’;. Since these vector spaces have no non-zero subspaces, they 
are one-dimensional over D’/P’;. Hence the composition factors are isomorphic as D’-modules 
and hence also as D-modules and as vector spaces over D. Then [P'} 7 '/P’J : D] =[D'/P’;: D] 
= f; for j = 1, ..., e, It follows from (18) that [D/P’” : D] = ef; Since 
D'/D'P = D'/PY'@-- @D'/P; we have [ D’: D ] = 2,8e,f;. We have shown in Theorem 10.11 that È 
ef; <n and È ef;=n if E/F is separable. Hence [D’: D] <n and =n if E/F is separable. 

Let e = max{e;}. Then (P’; P’y ... P'a)? c D'P. Hence P’, ... P’,/D'P is a nilpotent ideal in 
D = D'/D'P and DP °*: Po/D'P) = D'/Pi + P; = D'/P1 © ®D'/Pa, Since every D'/P'; is a field, it 
follows that P’, ... P',/D'P = rad D’ and D’/rad D’ is a direct sum of g ideals isomorphic to the 


fetds Dip", LI 


We now give the following 


DEFINITION 10.4. A non-zero prime ideal P of D is said to be unramified in D' if D'P = P’, 
... P', where the P', are distinct prime ideals in D' and every D'/P'; is separable over D = 
D/P. Otherwise, P is ramified in D'. 


It is clear from Theorem 10.12 that P is unramified in D' if and only if Ò’ = D'/D'P is a 
direct sum of fields that are separable over D = D/P. We shall show that if E/F is separable, 
then there are only a finite number of prime ideals P of D that are ramified in D’ and in 
principle we shall determine these prime ideals. For this purpose we give the following 


DEFINITION 10.5. Let D, F, E, D' be as usual and assume that E/F is separable. Then the 
discriminant (ideal) dpp of D' over D is the ideal generated by all of the elements 


(19) det (Tr)r(uitt;)) 
where (uy, ..., U,) is a base for EIF consisting of elements u;e D'. 


We have seen in the proof of Proposition 10.9 (p. 612) that det(T;)-(u,u;)) # 0 for every base 
(uj, ..., U„) of a separable extension E/F. The main result on ramification is 


THEOREM 10.13. Let D be a Dedekind domain, F its field of fractions, E a finite 
dimensional separable extension of F, D' the subring of E of D-integral elements. Then a 
non-zero prime ideal P of D is ramified in D' if and only if P is a divisor of the discriminant 


dpp- 


Of course, this implies the finiteness of the set of ramified primes. To achieve the proof of 
Theorem 10.13 we need to obtain a trace criterion that a finite dimensional commutative 
algebra A over a field F be a direct sum of ideals that are separable field extensions of F. 


If A is a finite dimensional algebra over a field F, we define the trace bilinear form T(a, b) 
on A/F by 


(20) T(a, b) = T4)p(ab) 


where T4;p is the trace defined by the regular representation. Since T4;p is a linear function and 
T4;r(ab) = T4,ba), T(a, b) is a symmetric bilinear form (BAI, pp. 424, 426). We recall also 
that 7(a, b) is non-degenerate if and only if det(T(u;, u;)) # 0 for any base (u;, ..., u,) of A/F. 


We require the following 


LEMMA. Jf A is a finite dimensional commutative algebra over a field F, then A/F is a 
direct sum of ideals that are separable fields over F if and only if the trace bilinear form 
T(a, b) is non-degenerate on A. 


Proof. Suppose first that 4= E197: ®Es where E; is a separable field extension of F. Let 


(uiis ---» Ujn,) be a base for E;. Then (up ..., Uin, -- ) is a base for A/F and 


a Ug], EEE Uon, 


(2 1) det (T (T( Uik, Uj) =I det (T Uik, U Uit)) 


where T; denotes the trace bilinear form on E;. Since we have shown (p. 612) that 7; is non- 
degenerate, the foregoing formula shows that T is non-degenerate on A. 

Conversely, suppose that T is non-degenerate on A. We show first that Æ is semi-primitive. 
Let z e rad A. Then za = az is nilpotent for every a € A. Since the trace of a linear 
transformation is the sum of its characteristic roots, the trace of a nilpotent linear 
transformation is 0. Hence 7(a, z) = 0 for all a. Since T is non-degenerate, z = 0. Thus rad 4A = 
0 and A is semiprimitive. Then 4 = £1 ®°"®"s where E; is a field extension of F. By (21), the 


trace bilinear form T; is non-degenerate on every E; On the other hand, if E/F is a/finite 


dimensional inseparable extension field of F then the trace function is identically 0 on Æ. For, 
E? S where S is the maximal separable subfield of E. Then £E is purely inseparable over S. It 
follows that there exists a subfield K of E containing S such that [E : K] = p the characteristic. 


Then the minimum polynomial over K of any a € E is either of the form x? — a or of the form x 


— a. In either case the characteristic polynomial is of the form x? — p (6 = a? in the second 
case). Then 7;,,(a) = 0 for all a. By the transitivity of the trace (BAI, p. 426) we have Tpp(a) 


= TAT rx(a)) = 0. It follows that the trace bilinear form on an inseparable field is identically 


0. Hence every £; is separable. E 


We can now give the 


Proof of Theorem 10.13. We assume first that D is a p.i.d. By Proposition 10.9, D’ is finitely 
generated as D-module. Since there is no torsion and D is a p.i.d., D' is a free D-module. Since 
every element of E has the form?” 'w where r € D and u € D’, the rank of D' over D=n=[E: 
F]. Let (v, ..., Vn) be a D-base for D’. Then det(7(v,, v;)) generates dpp, so we have to show 


1 

that D’ is a direct sum of separable field extensions of D if and only if Pdoes not contain the 
element det(7(v;, v;)). Since (v, ..., v,) is a D-base for D’, PD' = X Pv; and if P; = v; + PD’, 
then (P4, ..., Pn) is a base for D' over D. It follows that if a ~ p(a) is the regular matrix 
representation of E/F obtained by using the base (vy, ..., v,), then the regular matrix 
representation of D'/D using the base (P4, ..., Pn) is a ~ Pa) (4 = a + PD’). This implies that 
det(7(V;, ¥;)) = det(7(v;, v;)) + P. Hence by the Lemma, D’ is a direct sum of separable fields 
over D’ if and only if det (7(v;, V;)) é P. This completes the proof in the case in which D is a 
p.i.d. 

The proof for arbitrary D can be reduced to the p.i.d. case by localizing at the prime P of D. 
Since we are interested in localizing D' as well as D and since P is not a prime ideal in D’, we 
make explicit the localizing monoid as S= D — P. We form D, and D',. By Proposition 7.16, D 


' 1s the integral closure of D, in E. Since P’; N D = P, P' N S= Ø. Hence P’;, is a prime ideal 
in D',. We have P, D', = P'),°1 ... P'g.“e and the P’,, are distinct. Since P's N D' c P'p' N D' = 
P',, we have P’;,  D' = P’;. It follows as in the proof of Theorem 10.10 that we have a semi- 
linear map of D’,/P’;, over D,/P, onto D'/P’; over D/P, which is a ring isomorphism. Hence D 
' /P',, is separable over D,/P, if and only if D'/P’; is separable over D/P. Thus P is ramified in 
D' if and only if P, is ramified in D',. 

We show next that dp /D, = (dpyp)s. Let(uj, ..., Un) be a base for E/F such that the u; € D'. 
Then u; € D', and det(7(u;, u;))€ dpp, Hence dpyp C dp yp, and (dpyp), © apy yp,- Next let 
(u'i, ..., U'„) be a base for E/F such that the u'; € D’,. Then the u’;=u;s |, u; € D', se D- P, 
and det(7(u',))= s~” det(T(u;, u;)) € (dpyp)s- Hence dp jp © (dpp)s 80 dp jp, = Apr yp, It is 


clear also that P is a divisor of dpp if and only if P, is a divisor of (dpyp)s = dp jp,. We have 
therefore achieved a reduction of the theorem to the rings D, and D’,. Since D, is a p.i.d., the 


result follows from the case we considered first. Ld 


EXAMPLE 


Let m be a square-free integer, E = Q(» m), and let D' be the integral closure of Z in E£. Then D' 
has the base (1, V ™) over Z ifm = 2 or 3 (mod 4) and D' has the base (1,(1 + v ™)/2) over Z if 
m = | (mod 4) (exercises 4 and 5, p. 287 of BAI). Hence in the first case, the discriminant dp 
z İs the principal ideal generated by 

2 0 


0 2m 


= 4m. 


Hence the ramified primes (which are the primes p in Z such that 2, is a square of a prime 
ideal in D’) are 2 and the prime divisors of m. 
In the second case (m = 1 (mod 4)), dpz is the principal ideal generated by 


Hence the ramified primes are the prime divisors of m. 


EXERCISES 


1. Notations as in the example. Show that if p is odd and pD unramified in D’, then pD' is a 
prime or a product of two distinct prime ideals in D’ according as the Legendre symbol 
(m/p) =— 1 or = 1. Show that if m = 1 (mod 4), then 2D' is prime or a product of two 
prime ideals in D' according as m = 5 or m = 1 (mod 8). (Hint: Use Theorems 10.9 and 
9.14.) 

2. Let E be the cyclotomic field @(¢), € a primitive /th root of unity, / a prime, as in exercise 
1, p. 618, which showed that (€ — 1)D’ is prime in D'. Show that [D‘/(¢ — 1)D' : 2//z] = 1 
and hence that |D/(¢ — 1)D'| =1 Show that D’ = 2[¢] + (€ — 1)D’ Use exercise 2, p. 276 of 
BAI, to show that |det 7(¢', F| = 1-7 if 0 < i,j <1 — 2. Hence show that D' c + DZE] 
and //*+2)D’ c 2[£] c D' Use these results to prove that D' = 2[£] and that /2 is the only 
prime ideal of Z that is ramified in D’. 

3. Show that a finite dimensional commutative algebra A over a field F is a direct sum of 
separable fields if and only if A; is semi-primitive for every field extension E of F. (The 
latter property was discussed on p. 374 where it was called separability.) 


10.6 FINITELY GENERATED MODULES OVER A DEDEKIND 
DOMAIN 


In this section we shall extend the theory of finitely generated modules over a p.i.d. that we 
treated in BAI (sections 3.6-3.9, pp. 179-194) to finitely generated modules over a Dedekind 
domain. We recall that if M is a module over a domain D, the subset T of elements x e M for 
which there exists an a £ 0 such that ax = 0 is a submodule called the torsion submodule of M. 
M is called torsion free if T = 0. Any submodule of a torsion-free module is torsion free. 
Hence, since any free module is torsion free, any submodule of a free module is torsion free. It 
is clear also that if T is the torsion submodule of a module M, then M/T is torsion free. We 
shall derive next a pair of less obvious results on torsion-free modules. 


PROPOSITION 10.11. Any finitely generated torsion-free module M over a domain D is 
isomorphic to a submodule of a free module of finite rank. 


Proof. Let {u}, ..., u,} be a set of generators for M. We assume M + 0 and u; # 0. We may 
assume that we have an s, 1 < s, < r such that {u}, ..., u,} are linearly independent over D in 
the sense that }')"dju; = 0 holds only if every d; = 0, but {u, ..., us, Us + j} are linearly 
dependent over D for all j, 1 <j <r — s. Then the submodule >’,*Du; of M is free and hence the 
result holds if s = r. On the other hand, if s < r, then for each j there exists a nonzero d; e D 
such that dju, +; € }4°Du;. Then du, +; € )))"Du; for d= I] d; + 0 and so dM c }/,* Du;. Hence 
the D-endomorphism x ~ dx of M maps M into the free submodule ° Du; Since M is torsion 
free, this map is a monomorphism. Hence M is isomorphic to a submodule of the free module 


YF Du;. 


We shall obtain next a generalization of the theorem that any submodule of a finitely 
generated free module over a p.i.d. is free (Theorem 3.7 of BAI, p. 179). This is 


PROPOSITION 10.12. Let R be a ring, which is not necessarily commutative, with the 
property that every left ideal of R is a projective R-module. Then any submodule M of the 


free module R™ is a direct sum of m < n submodules isomorphic to left ideals of R. Hence M 
is projective. 


Proof. The result is clear if n = 1, since the submodules of R = R“ are left ideals. Hence 
assume that n > 1. Let S denote the submodule of R” of elements of the form (x), ..., x, — 1, 0). 


Evidently S = R” ~ 1], Now consider the R-homomorphism p : y = (Yp ..., Yp) >Y, of M into R. 


The image is a left ideal J of R and the kernel is isomorphic to a submodule N of R” ~ D. We 
have an exact sequence 


(22) 0-N5M451-0. 


Since J is projective, this exact sequence splits (p. 150); hence M = N © I. Since N c R” ~ D, 
we may use induction to conclude that N is isomorphic to a direct sum of < n — 1 left ideals. 
Then M is isomorphic to a direct sum m < n left ideals of R. Since any left ideal is projective, 


it follows that M is projective. 


We now assume that M is a finitely generated module over a Dedekind domain D. The 
defining property for D that we used is that any fractional ideal of D is invertible and hence is 
projective (Proposition 10.3). In particular, any integral ideal of D is projective and since 0 is 
trivially projective, every ideal in D is a projective module. Thus D satisfies the hypothesis of 
Proposition 10.12. Let T be the torsion submodule of M and put M = M/T. Then M is finitely 
generated and torsion free. Hence, by Proposition 10.11, M is isomorphic to a submodule of a 
free module of finite rank. Then, by Proposition 10.12, M is projective. Since we have an 
exact sequence 0 — T — M — M — 0 and M is projective, M = M @® T. Then T is a 
homomorphic image of Mand so it, too, is finitely generated. It is clear that we have reduced 
the problem of classifying finitely generated D-modules into isomorphism classes to the same 
problem for torsion-free modules and torsion modules. 

We consider first the torsion-free modules. We have seen that any such module is 
projective. On the other hand, if M is projective, M is a direct summand of a free module, and 
since D is a domain, free modules and hence projective modules over D are torsion-free. Thus 
M is finitely generated torsion free over D if and only if M is finitely generated projective. 

In determining conditions for isomorphism of two such modules, we may assume that the 
modules have the form M=/, ® ... ® Z, where J; is an integral ideal. Although there is no gain 
in generality in doing so, it is somewhat more natural to assume that the J; are fractional ideals. 


Consider the localization Mp* determined by the monoid D* of non-zero elements of D. We 
have Mo = Do @oM = lip @* Shr and since Dp* is the field F of fractions of D and J;p* = F, 
Mp* is an n-dimensional vector space over F. Thus Mp* = /”). The dimensionality n is the 
rank of M over D as defined in section 7.7 (Corollary, p. 415). The canonical map x ~> x/1 of 
M into Mp* is a monomorphism. It follows that we have an isomorphism of M with the D- 
submodule of F) consisting of the vectors (x, ..., x,,) where x; € I;. Hence we may assume 
that M consists of this set of vectors. 

If 7 is an isomorphism of M onto a second finitely generated torsion-free module M’, 7 has 
a unique extension to a bijective linear transformation of Mp*/F onto M'p*/F. It follows that M 
and M' have the same rank n and we may assume also that M’ is the D-submodule of F” 
consisting of the vectors (x’, ..., x'„) where x’; € J';, a fractional ideal in F, 1 <j < n. Taking 


into account the form of a bijective linear transformation of F™), we see that y has the form 


(23) (X55 «vce Mig “Pig evn am ia Xabi 


where A = (a;) € GL,(F), the group ofn x n invertible matrices with entries in F. We have x’, 
=>) xja; € I’, and if A l= (bjj), then x; =} x',b,; € L ifx', € I, 1 < k <n. Conversely, if A 
€ GL,(F) satisfies these conditions, then (23) is an isomorphism of M onto M'. 

Ifn = 1, the foregoing conditions imply that two fractional ideals / and /' are isomorphic if 
and only if there exists an a # 0 in F such that J’ = Ja. We can state this result in terms of the 
class group of the Dedekind domain D, which we define to be the factor group #/Y% where F 
is the multiplicative group of fractional ideals of D and Z is the subgroup consisting of the 
principal ideals aD, a + 0 in F. The ideal J and al = (aD)I are said to be in the same class. 
Thus we have shown that / and /' are isomorphic D-modules if and only if Z and 7' are in the 
same class. It is clear that we have a 1-1 correspondence between the isomorphism classes of 
fractional ideals and hence of torsion-free modules of rank one with the elements of the class 
group F /@. 

We prove next the 


LEMMA. Let J, and I, be fractional ideals. Then M =1, ® I, =D ® Ih. 


Proof. We use exercise 10 on p. 605. According to this, if a, #0 in/J, and J=a,l, \(c D), 
then there exists an a € J, such that a,J, ! + ah, ~ | = D. Hence we have b; € J, ! such that 
a,b, + ab, = 1. Then the matrix 


b; = (y 
(24) A= 
b» di 


ay s 
is invertible with 4 ! = (5: b, : If x, = L;, then yı = xb, + X2b7 e D and Y2 = — X12 + X24] 
€ lih. On the other hand, if y; € D and yy = cyc9, C; € Z, then x; = ayy, — bycycy € J} and x = 


ayı + byc\c> € L. Hence (x1, x2)> (x1, X2)4 is an isomorphism of J, ® J, onto D @ J, I. L 


We can now prove the main theorem on finitely generated modules without torsion over a 
Dedekind domain. 


THEOREM 10.14. Any finitely generated torsion-free module M over a Dedekind domain 
is isomorphic to a direct sum of a finite number of integral ideals. Necessary and sufficient 
conditions for the isomorphism of M=1I, ® ... ® I, and M'=I'; ® ... ® Tm where the I; and 


I', are fractional ideals are m =n and I, ... 1, and I', ... T' „are in the same class. 


Proof. The validity of the first statement has been noted before. Now suppose that M = M' 
where M and M' are as in the statement of the theorem. Then m = n and we have a matrix A = 


(aj) € GL,(F) such that if x; el,1<j<n, then x’; = a= xjajz € T’. Moreover, A= 
(b,) and x’, € Ly, 1 <k<n, iene = $ x'kby; € Jj. The first ie implies that x,a; € I’; 
for any x; € L. Thus La; CI';, so L'y Majų C D and aj, € I; p. Hence if (k, ..., k,) is a 
permutation of 1, 2, ..., n, then ay, ... ank» € (I I)! and (I1/',). This implies that det 4 e (ll 
Ly ‘AW, and (det ALY, c L. Similarly, (det A Ilr, c L. Hence Lr, = (det ALT, and 
SO II, and lIr, are in the same class. Conversely, assume that m = n and that | Ilj and lIr, are in 
the same class.By the lemma, M =; ®/,®...®n=D/,,0/,0...®/,=D@eD@6I, 1, 
el®...@1,=...2D®...eDelIl. Similarly, M' = DƏ ... © D @ l7. Since [| 7, and 


lIr, in the same class, these are isomorphic and hence M = M’. E 


We consider next the structure of a finitely generated torsion module M over D. Our 
program for studying M will be to replace it by a closely related module over a p.i.d. Let J = 
anny. IfM = Dx, + Dx, + ... + Dx,, then 


I = ()ann x; > []ann x; 4 0. 
Hence J = annpM £ 0. Let {P}, ..., P,} be the prime ideal divisors of Z and let 


r r 
S=D-|P, = (\(D-P). 

1 1 
Then S is a submonoid of the multiplicative monoid of D and we can form the ring D, and the 
D,-module M, = D, ® pM. Any prime ideal of D, has the form P, where P is a prime ideal 
such that P N S= Ø (p. 401). Then P c U P; and hence P c P; for some i (Proposition 7.1, p. 
390), which implies P = P;. Thus D, is a Dedekind domain with only a finite number of prime 
ideals, P'; = P;,, 1 <i <r. Such a domain is a p.i.d. (exercise 15, p. 625). Moreover, M, is a 
torsion D- -module since 7, # 0 and Z, M, = 0. 


The fundamental theorem on finitely generated modules over a p.i.d. (BAI, p. 187) is 
applicable to the D,-module M,. According to this, M, = Dy; ® ... ® Dsy,, where annp,y1 D 


annp,Y2 D ... D aNNpsYm Moreover, annp,y; D 1, # 0 and annp y; is a proper ideal in D,. We 


shall be able to apply this result to determining the structure of M as D-module, since we can 
recover M from M, by using the following 


LEMMA. Let I be a non-zero ideal in D, P}, ..., P, the prime ideal divisors of I in D, S =") 
(D — P). Let N be a D-module such that IN = 0. Then N = (DID® pN, as D-modules. 


Proof. We have (D/N®pN, = (D/1)®@)p(D, 8 pN= (D/D® p DJ®@pN = (D/D, @ pN. It is 
readily seen that the elements s + J, s € S, are units in D/Z, which implies that (D/D, = D/I. 


Hence (D/D, ®@ pN =N, since IN = 0. E 
We can now prove the following structure theorem. 


THEOREM 10.15. Let M be a finitely generated torsion module over a Dedekind domain 
D. Then M is isomorphic to a direct sum of cyclic modules Dz, ® ... ®Dz,, such that ann z} 


> annz, >... Dann Z„ * 0. Moreover, the ideals ann z; 1 < i < m, are uniquely determined. 


Proof. Let S be as before. Then M, = Dy ® ... ® DYm Where annp yı D ... D annpsy,,. 
Then if /'; = ANN sy,» Dy; = D,/I';. Let I; = {a € Dia/s e T; for some s e S}. Then Z, = T; (p. 
401) and D/J; is a cyclic D-module, Dz;, such that (D/L); = D,/Ij,; = D,/I’;. Moreover, annpz; = 
I, and I, > ... DJ,,. Now consider the module N = Dz, ® ... ® Dz,,. We have N, = (Dz)), ® ... 
® (Dz,,), = Dy, ® ... ƏDYm = M,. Hence by the lemma, M = N = Dz, ® ... ®Dz,,. The 
uniqueness follows also by using S-localization and the corresponding uniqueness result in the 


p.i.d. case (BAI, p. 192). We leave the details to the reader. 


Theorems 10.14 and 10.15 and the fact that any finitely generated module over a Dedekind 
domain D is a direct sum of its torsion submodule and a torsion-free module constitute a direct 
generalization of the fundamental theorem on finitely generated modules over a p.i.d. If Dis a 
p.i.d., then the class group #/% = 1 and the results show that M is a direct sum of copies of D 
and cyclic modules Dz}, ..., Dz,, with non-zero annihilators J,, ..., /,, such that J; > ... D Im 
This is the fundamental theorem as given in BAI, p. 187. The Dedekind domain is a p.i.d. if 
and only if #/Y = 1. The group F/Z gives a measure of the departure of D from being a p.i.d. 
It is a classical group of algebraic number theory. There it is shown by transcendental methods 
that the class group is finite. On the other hand, it has been shown by L. Claborn (Pacific J. of 
Math., vol. 18 (1966), 219-222) that any abelian group is the class group of a suitable 
Dedekind domain. 

We shall now show that the class group coincides with the projective class group, which 
we defined in section 7.4. The latter, denoted as Pic D, is the set of isomorphism classes of 
faithful finitely generated projective modules of rank one with multiplication defined by the 
tensor product. Since D is a domain, projective D-modules are automatically faithful. Hence 
the elements of Pic D can be represented by the fractional ideals, and as we have seen, two 
such ideals are isomorphic if and only if they are in the same class. Hence we have the 
bijective map [/] ~ Z of Pic D onto #/% where [/] denotes the isomorphism class of the 
fractional ideal 7. Now we have the canonical module homomorphism of J, ® J, into 4h 


sending a, ® a, into a, a>. Localization at all of the maximal ideals show that this is an 
isomorphism. Thus /, ® /, = /,J, and hence [/] + /% is a group isomorphism of Pic D with the 
class group. 
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11 


Formally Real Fields 


In Chapter 5 of Basic Algebra I we studied polynomial equations, inequations, and inequalities 
in a real closed field. We defined such a field to be an ordered field in which every odd degree 
polynomial in one indeterminate has a root and every positive element has a square root. We 


proved that if R is real closed, then ®'v~" is algebraically closed. The main problem we 
considered was that of developing an algorithm for testing the solvability of a system of 
polynomial equations, inequations, and inequalities in several unknowns ina real closed field. 
In the case of a single polynomial equation in one unknown, the classical method of J. C. F. 
Sturm provides a solution to this problem. We gave this method and developed a far-reaching 
extension of the method to the general case of systems in several unknowns. A consequence of 
this (also indicated in BAI) is Tarski’s theorem, which states roughly that a system of 
polynomial equations, inequations, and inequalities that has a solution in one real closed field 
has a solution in every such field (see BAI, p. 340, for the precise statement). 

We now resume the study of real closed fields, but we approach these from a different point 
of view, that of formally real fields as defined by Artin and Schreier. The defining property for 
such a field is that — 1 is not a sum of squares in the field, or equivalently, if 2 = 0 for a; in 
the field, then every a; = 0. It is clear that any ordered field is formally real. On the other hand, 


as we Shall see, every formally real field can be ordered. Hence a field is formally real if and 
only if it can be ordered. 

The Artin-Schreier theory of formally real fields is an essential element in Artin’s solution 
of one of the problems posed by Hilbert at the 1900 Paris International Congress of 
Mathematicians. This was Hilbert’s seventeenth problem, which concerned positive semi- 
definite rational functions: Suppose /(x),...,x,,) is a rational expression in indeterminates x; 
with real coefficients such that f(a,,...,a,,) = 0 for all (aj, ...,a,) where fis defined. Then is f 
necessarily a sum of squares of rational expressions with real coefficients? Artin gave an 
affirmative answer to this question in 1927. A new method of proving this result based on 
model theory was developed by A. Robinson in 1955. We shall give an account of Artin’s 


theorem. Our approach is essentially model theoretic and is based on the theorem of Tarski 
mentioned earlier. 

Artin’s theorem gives no information on the number of squares needed to express a given 
f(x},..-, X,). Hilbert had shown in 1893 that any positive semi- definite rational function in two 
variables is expressible as a sum of four squares. In 1966 in an unpublished paper, J. Ax 
showed that any positive semi-definite function in three variables is a sum of eight squares and 
he conjectured that for such functions ofn variables, 2” squares are adequate. This was proved 
in 1967 by A. Pfister by a completely novel and ingenious method. We shall give his proof. 

We shall conclude this chapter with a beautiful characterization of real closed fields due to 
Artin and Schreier. It is noteworthy that the proof of this theorem initiated the study of cyclic 
fields of p° dimensions over fields of characteristic p. 


11.1 FORMALLY REAL FIELDS 


We recall that a field F is said to be ordered if there is given a subset P(the set of positive 
elements) in F such that P is closed under addition and multiplication and F is the disjoint 
union of P, {0}, and — P = {— pp € P} (see BAI, p. 307). Then F is totally ordered if we 
define a > b to mean a — be P. Moreover if a> b, then a + c > b + c for every c and ap > bp 


for every p € P. Ifa + 0, then a? = ( — a)? > 0. Hence if 2“ = 0 in F, then every a; = 0. This is 
equivalent to: —1 is not a sum of squares in F. It follows that the characteristic of F is 0. 
Following Artin and Schreier we now introduce the following 


DEFINITION 11.1. Afield F is called formally real if— 1 is not a sum of squares in F. 


It is clear that any field that can be ordered is formally real. The converse of this 
observation is a theorem of Artin and Schreier. We shall now prove this by an argument due to 
Serre that is based on the following 


LEMMA 1. Let P, be a subgroup of the multiplicative group F* of a field F such that Po is 


closed under addition and contains all non-zero squares. Suppose that a is an element of F* 
such that —a £ Po. Then P; = Po + Poa = {b + caļb, c € Po} is a subgroup of F* closed under 
addition. 


Proof. Evidently P, is closed under addition and if b; c; € P, for i = 1, 2, then (b; + Cya)(b, 
+ ca) = (6,55 F Gica) F (bic F bc;)a E P, since bb, + Gcd and bic» + boc E€ Po. We 
note next that P} does not contain 0, since otherwise we have b + ca = 0 for b, c e Po, which 


gives —a = bc”! € Po, contrary to the hypothesis on a. Also we have 


(b+ca)' =(b+ca)(b+ca) * = b(b+ca)*+c(b+ca) 7aeP, 


since b(b + ca) * and c(b + ca)? € Po. Hence P} is a subgroup of F*. O 


Now let F be formally real and let Pp be the set of sums Xai? with every a; # 0. Evidently 
Po is closed under addition and since (Yia;*)(Yjb,;7) = Di,j(a;b;)*, Pois closed under addition 
and multiplication is closed under multiplication. Moreover, Po contains all of the non-zero 
squares and hence ifa = Ya, , a; #0, then a '=aa 7 € Py Thus Py satisfies the conditions of 


the lemma and so the set of subsets P' satisfying these conditions is not vacuous. We can apply 
Zorn’s lemma to conclude that this set of subsets of F contains a maximal element P. Then it 
follows from the lemma that if a is any element of F*, then either a or — a e P. Hence F = P U 
{0} U—P where — P ={ — pp e P} and since 0 £ P and P is closed under addition, P N — P = 
Ø and Og —P. Thus P, {0}, — P are disjoint and since P is closed under addition and 
multiplication, P gives an ordering of the field F. We therefore have the following 


THEOREM 11.1. Afield F can be ordered (by a subset P) if and only if it is formally real. 


In BAI (p. 308) we defined a field R to be real closed if it is ordered and if (1) every 
positive element of R has a square root in R and (2) every polynomial of odd degree in one 
indeterminate with coefficients in R has a root in R. We showed that the ordering in a real 
closed field R is unique and that any automorphism of such a field is an order automorphism 
(Theorem 5.1, p. 308). Moreover, we proved an extension of the so-called fundamental 


theorem of algebra: If R is real closed, then ®'v~") is algebraically closed (Theorem 5.2, p. 
309). We shall now give a characterization of real closed fields in terms of formal reality: 


THEOREM 11.2. Afield R is real closed if and only if R is formally real and no proper 
algebraic extension of R is formally real 


We separate off from the proof the following 


LEMMA 2. /fF is formally real, then any extension field F(r) is formally real if either r = 
V4 for a> 0 in F orr is algebraic over F with minimum polynomial of odd degree. 


Proof. First, letr = V® a> 0. Suppose that F(r) is not formally real. Then r ¢ F and we have 
a; b; e F such that — 1 = Y(a; + br}. This gives Ya? + Yb7 a=- 1. Since a > 0, this is 
impossible. 

In the second case let f(x) be the minimum polynomial of r. Then the degree of f(x) is odd. 
We shall use induction on m = deg f(x). Suppose that F(r) is not formally real. Then we have 


polynomials g(x) of degree < m such that Ye(r)* = — 1. Hence we have Yg(x)? = — 1 + 
f(x)g(x) where g(x) e F[x]. Since F is formally real and the leading coefficient of g(x)? is a 
square, it follows that deg(— 1 + f(x)g(x)) = deg(dg{x)*) is even and < 2m. It follows that deg 
g(x) is odd and < m. Now g(x) has an irreducible factor h(x) of odd degree. Let s be a root of 


h(x) and consider F(s). By the induction hypothesis, this is formally real. On the other hand, 
substitution of s in the relation Xg (x) =— 1 + f(x)g(x) gives the contradiction Yg(s)? =- 1. O 


We can now give the 


Proof of Theorem 11.2. Suppose that R is real closed. Then C= ®'v ~") is algebraically closed 
and C ? R. Evidently C is an algebraic closure of Rand so any algebraic extension of R can be 
regarded as a subfield of C/R. Hence if it is a proper extension it must be C, which is not 
formally real since it contains Y^". 

Next suppose that R is formally real and no proper algebraic extension of R has this 
property. Let a € R be positive. Then Lemma 2 shows that R( V) is formally real. Hence R(v ° 


) = Rand Ve R. Next let f(x) be a polynomial of odd degree with coefficients in R. Let g(x) 
be an irreducible factor of f(x) of odd degree and consider an extension field R(r) where g(r) 
= 0. By Lemma 2, R(r) is formally real. Hence R(r) = R. Then r e Rand f(r) = 0. We have 
therefore verified the two defining properties of a real closed field. Hence R is real closed. O 


_We shall show next that the basic property of a real closed field R that VP: ¢ R and C= R( 
vB ') is algebraically closed characterizes these fields. 


THEOREM 11.3. A field R is real closed if and only if YP é R and C = R(W4) is 
algebraically closed. 


Proof. It suffices to show that if R has the stated properties, then R is real closed. Suppose 
that R satisfies the conditions. We show first that the sum of two squares in R is a square. Let a, 
b be non-zero elements of R and let ube an element of C such that u? = a + b;, i = V Pi We have 
the automorphism x + iy ~ x — iy, x, y € R, of C/R whose set of fixed points is R. Now 


a’ +b? = (a+bi)(a—bi) = wy = (ui)? 


and wae R. Thus a? + b? is a square in R. By induction, every sum of squares in R is a square. 

Since — | is not a square in R, it is not a sum of squares and hence R is formally real. On the 

other hand, since C is algebraically closed, the first part of the proof of Theorem 11.2 shows 

that no proper algebraic extension of R is real closed. Hence R is real closed by Theorem 11.2. 
o 


EXERCISES 


1. Show that if F is formally real and the x; are indeterminates, then F(x),..., x„) is formally 
real. 


2. Define an ordering for a domain D as for a field: a subset P of D such that P is closed 
under addition and multiplication and D is the disjoint union of P, {0}, and — P. Show 
that an ordering in a domain has a unique extension to its field of fractions. 

3. Let F be an ordered field. Show that F[x], x an indeterminate, has an ordering defined by 
ap tan 1+... >0ifag>0. 

4. Call an ordered field F archimedean-ordered if for any a > 0 in F there exists a positive 


integer n such that n( = n 1)> a. Show that the field F(x), x an indeterminate, ordered by 
using exercises 2 and 3 is not archimedean. 


5. Prove that any archimedean-ordered field is order isomorphic to a subfield of R. 


6. (T. Springer.) Let Q be an anistropic quadratic form on a finite dimensional vector space 
V over a field of characteristic 4 2. Let E be an odd dimensional extension field of F and 
let Qç be the extension of Q to a quadratic form on Vp. Show that Qp is anisotropic. 


(Hint: It suffices to assume that Æ = F(p). Prove the result by induction on [E: F] using 
an argument like that in the second part of the proof of Lemma 2.) 


11.2 REAL CLOSURES 


DEFINITION 11.2. Let F be an ordered field. An extension field R of F is called a real 
closure of F if (1) R is real closed and algebraic over F and (2) the (unique) order in R is an 
extension of the given order in F. 


A central result in the Artin-Schreier theory of formally real fields is the existence and 
uniqueness of a real closure for any ordered field F. For the proof of this result we shall make 
use of Sturm’s theorem (BAI, p. 312), which permits us to determine the number of roots in a 
real closed field R of a polynomial f(x) € R[x]. Let f(x) =x" +ax” 71+... +a, and M=1+ 
la,|+ ... + |a,| where | | is defined as usual. Then every root of f(x) in R lies in the interval — M 
<x < M (BAI, exercise 4, p. 311). Define the standard sequence for f(x) by 


f= flx),  fi(x)= f(x), _ the derivative of f(x), 
fi-100 = AWA) -Si+ (x) — with deg fi, < deg f, 


for i > 1. Then we have an s such that f; +; = 0, and by Sturm’s theorem, the number of roots of 
f(x) in Ris V_ jy —Vjy where V, is the number of variations in sign in fola) fi(a), ..., f(a). We 
can use this to prove the 


LEMMA. Let R, i = 1, 2, be a real closed field, F; a subfield of R, a ~ @ an order 
isomorphism of F; onto F, where the order in F; is that induced from R, Suppose that f(x) is 


a monic polynomial in F [x], [œ the corresponding polynomial in F>[x]. Then f(x) has the 
same number of roots in R; as J) has in Ro. 


Proof. If M is as above, then the first number is V_ yy — Vj, and the second is V-a- Va 
determined by the standard sequence for /(). Since a ~ a is an order isomorphism of F} onto 
F, it is clear that these two numbers are the same. 0 


We can now prove the important 


THEOREM 11.4. Any ordered field has a real closure. If F; and F are ordered fields with 
real closures R; and R, respectively, then any order isomorphism of F, onto F, has a unique 
extension to an isomorphism of R; onto R, and this extension preserves order. 


Proof. Let F be an ordered field, f an algebraic closure of F, and let E be the subfield of / 
obtained by adjoining to F the square roots of all the positive elements of F. Then E is 
formally real. Otherwise, E contains elements a; such that $a? =— 1. The a; are contained ina 


subfield generated over F by a finite number of square roots of positive elements of F. Using 
induction on the first part of Lemma 2 of section 11.1 we see that this subfield is formally real 


contrary to Ya, = — 1 with a; in the subfield. We now consider the set of formally real 
subfields of / containing E. This set is inductive, so by Zorn’s lemma, we have a maximal 
subfield R in the set. We claim that R is real closed. If not, there exists a proper algebraic 
extension R’ of R that is formally real (Theorem 11.2). Since J is an algebraic closure of R, we 
may assume that R'c J so we have J > R'? R. This contradicts the maximality of R. Hence R 
is real closed. Now suppose a € F and a> 0. Then a = b° for b € E c Rand hence a > 0 in the 
order defined in R. Thus the order in R is an extension of that of F and hence R is a real closure 
of F. 

Now let F; and F, be ordered fields, R; a real closure of F;, and let o: a + a@ be an order 
isomorphism of F} onto F,. We wish to show that o can be extended to an isomorphism } of R, 
onto R». The definition of} is easy. Let r be an element of R4, g(x) the minimum polynomial of 
r over F and let r} < r3 < ... <7 ZEF <... < Fm be the roots of g(x) in R; arranged in 
increasing order. By the lemma, the polynomial #*) has precisely m roots in R, and we can 
arrange these in increasing order as R, < R, < ... < R,,. We now define >) as the map sending r 


into the kth one of these roots. It is easy to see that E is bijective and it is clear that } is an 
extension of o. However, it is a bit tricky to show that >’ is an isomorphism. To see this we 
show that if S is any finite subset of R,, there exists a subfield E; of R,/F, and a 


monomorphism 7 of E/F; into R,/F, that extends o and preserves the order of the elements of 
S, that is, if S = {9, < s2 < ... < s„}, then 9s, <ns)<... ys. Let T= SUtV8+1-Sdl Si <n- ang 
let E, = F(T). Evidently, E, is finite dimensional over F} and so £} = F\(w). Let f(x) be the 
minimum polynomial of w over F}. By the lemma, /(*) has a root in R», and we have a 


monomorphism nn of £,/F, into R/F, sending w into w.Now 


Misis 1)= ns) = m(si+1—S) = MV sis 1 =30°) = (/ 8x18" > 0. Hence y preserves the order of the s; Now let 
r and s be any two elements of E; and apply the result just proved to the finite set S consisting 


of the roots of the minimum polynomials of r, s, 7 + s, and rs. Since 7 preserves the order of 
the elements of S, n0) = (r), n(s) = X(s), nr +s) =} (r + s), and (rs) =} (rs). Hence } (r + 
s) = y(r + s) = n(r) + (s) =} (r) + }(s) and similarly } (rs) = }(r)} (s). Thus } is an 
isomorphism. 

It remains to show that } is unique and is order preserving. Hence let >’ be an isomorphism 
of R; onto R,. Since >’’ maps squares into squares and the subsets of positive elements of the R; 


are the sets of non-zero squares, it is clear that }” preserves order. Suppose also that >” is an 
extension of o. Then it is clear from the definition of >° that }' = >)’. This completes the proof. 
o 


If R, and R, are two real closures of an ordered field F, then the identity map on F can be 
extended in a unique manner to an order isomorphism of R, onto Rz. In this sense there is a 
unique real closure of F. 

It is easily seen that the field © of rational numbers has a unique ordering. Its real closure R 
o is called the field of real algebraic numbers. The field €» = Roly =) is the algebraic closure 
of ©. This is the field of algebraic numbers. 


EXERCISES 


1. Let F be an ordered field, E£ a real closed extension field whose order is an extension of 
the order of F. Show that E contains a real closure of F. 

2. Let F be an ordered field, E an extension field such that the only relations of the form 
a;b? = 0 with a; > 0 in F and b; € E are those in which every b; = 0. Show that E can 
be ordered so that its ordering is an extension of that of F. 


11.3 TOTALLY POSITIVE ELEMENTS 


An interesting question concerning fields is: what elements of a given field can be written as 
sums of squares? We consider this question in this section and in the next two sections; we first 
obtain a general criterion based on the following definition. 


DEFINITION 11.3. An element a of a field F is called totally positive if a > 0 in every 
ordering of F. 


It is understood that if F has no ordering, then every element of F is totally positive. Hence 
this is the case if F is not formally real. If F is not formally real, then — 1 = Ya? for a; € F, and 


if char F £ 2, then the relation 
l-a  /1+a\ ,(1-a\ 
— fee SED ra EA: 


2 
(2) i= (55) 


shows that every element of F is a sum of squares. We shall need this remark in the proof of the 
following criterion. 


THEOREM 11.5. Let F be a field of characteristic # 2. Then an element a # 0 in F is 
totally positive if and only if a is a sum of squares. 


Proof. If0 + a= a;, then evidently a > 0 in every ordering of F. Conversely, assume that a 
+ 0 is not a sum of squares in F. Let J be an algebraic closure of F and consider the set of 
subfields E of J/F in which a is not a sum of squares. By Zorn’s lemma, there is a maximal 
one; call it R. By the preceding remark, R is formally real and hence R can be ordered. We 
claim that — a is a square in R. Otherwise, the subfield RIV ~ of //F properly contains R, so a 
is a sum of squares in ®(\V~4, Hence we have b,C,; € R such that a = Li tev-4", This gives 
yb,c; = 0 and a =b} — a Yc? Hence a(1 + Yc?) = 7 and 1 + Yc? £0, since R is formally 
real. Thenifc=1+Yc7, 


a=} bic! =} b? (1 +) a)? 


so a is a sum of squares in R, contrary to the definition of R. Thus — a = b? for a b € R and 


hence a = —b? is negative in every ordering of R. These orderings give orderings of F and so 
we have orderings of F in which a < 0. Thus a is not totally positive. D 


We shall apply this result first to determine which elements of a number field F are sums of 
squares. We have F = ®(r) where r is algebraic over ©. If Rọ is the field of real algebraic 
numbers ( = the real closure of ®), then Cy =Ro(v ~') is an algebraic closure of ® and of F. If n 
= [F: ©], then we have n distinct monomorphisms of F/® into Cp/®. These are determined by 
the maps r ~ r; 1 < i <n, where the r; are the roots of the minimum polynomial g(x) of r over 
©. Let rj, ..., rp be the r; contained in Rọ. We call these the real conjugates of r and we agree 
to put h = 0 ifnor; € Ro. Let o; 1 <i < h, be the monomorphism of F/® such that or = r;. Each 
o; defines an ordering of F by declaring that a > 0 in F if o,a > 0 in the unique ordering of Ro. 
We claim that these r orderings are distinct and that they are the only orderings of F. First, 
suppose that the orderings defined by o; and o; are the same. Then o,o; l is an order- 
preserving isomorphism of the subfield ©(7;) of Rọ onto the subfield ©(7;). Since Rọ is 


algebraic over ©(r;) and ©(7;), Ro is the real closure of these fields. Hence, by Theorem 11.4, 


l can be extended to an automorphism o of Rọ. Since Rọ is the real closure of ®, it 


0,0; 
Jol 

follows also from Theorem 11.4 that the only automorphism of Rọ is the identity. Hence o = 1 
and o; = oj. Next suppose that we have an ordering of F and let R be the real closure defined 


by this ordering. Since R is algebraic over ®, R is also a real closure of ©. Hence we have an 
order isomorphism of R/® onto R,/®. The restriction of this to F coincides with one of the o; 


and hence the given ordering coincides with the one defined by this o,. 
It is clear that the o; that we have defined can be described as the monomorphisms of F into 
R. Hence we have proved the following 


THEOREM 11.6. Let F be an algebraic number field, Rọ the field of real algebraic 


numbers. Then we have a 1 — 1 correspondence between the set of orderings of F and the set 
of monomorphisms of F into Rọ. The ordering determined by the monomorphism c; is that in 


whicha>0 forae fifoja>O0inR,. 


An immediate consequence of Theorems 11.5 and 11.6 is the following result due to Hilbert 
and E. Landau: 


THEOREM 11.7. Let F be an algebraic number field and let ©}, ...,0,(h = 0) be the 
different monomorphisms of F into the field Rọ of real algebraic numbers. Then an element 
a€ fis a sum of squares in F if and only if oa >O forl <i<h. 


EXERCISES 


1. Let F be an ordered field, E an extension field. Show that if b is an element of E that 
cannot be written in the form )a,b7 for a; > 0 in F and b e E, then there exists an 
ordering of E extending the ordering of F in which b < 0. 

2. Let F be an ordered field, R the real closure of F, and £ a finite dimensional extension of 
F. Prove the following generalization of Theorem 11.6: There is a 1 — 1 correspondence 
between the set of orderings of E extending the ordering of F and the set of 
monomorphisms of E/F into R/F. 

3. (I. Kaplansky-M. Kneser.) Let F be a field of characteristic + 2 that is not formally real. 
Suppose that |F*/F*?| = n < œ. Show that any non-degenerate quadratic form in n 
variables is universal. (Sketch of proof: Let a;,a>,... be a sequence of non-zero elements 
of F and let M, denote the set of values of the quadratic form a,x, + ... + azxg? for x; € 
F. Show that there exists a k <n such that M, , , = My. Then M} +1 =a, + F° + M; = M; 
where F? is the set of squares of elements of F. Iteration of M, = a; . F? + M, gives M, 


= ay, 4 \(F’ + ... + F?) + M}. Hence conclude that M, = F + M, and M, = F.) 


11.4 HILBERT’ S SEVENTEENTH PROBLEM 


One of the problems in his list of twenty-three unsolved problems that Hilbert proposed in an 
address before the 1900 Paris International Congress of Mathematicians was the problem on 
positive semi-definite rational functions: Let f be a rational function of n real variables with 
rational coefficients that is positive semi-definite in the sense that f(a), ... a,) = O for all real 
(a; ...,a,) for which fis defined. Then is f necessarily a sum of squares of rational functions 
with rational coefficients? By a rational function of n real variables with rational coefficients 
we mean a map of the form (ay, ... ap )~ f(a, ... a,)where f(x), ... Xp) = Q(X), ... X,)/ A(X), ... 
x,) and g and h are polynomials in the indeterminates x; with rational coefficients. The domain 
of definition of the function is a Zariski open subset defined by h(a, ... a,) # 0 and two 
rational functions are regarded as equal if they yield the same values for every point of a 
Zariski open subset. 

In 1927, making essential use of the Artin-Schreier theory of formally real fields, Artin 
gave an affirmative answer to Hilbert’s question by proving the following stronger result: 


THEOREM OF ARTIN. Let F be a subfield of 8 that has a unique ordering and let f be a 
rational function with coefficients in F such that f(a, ... a,) = 0 for all a; € F for which f is 


defined. Then f is a sum of squares of rational functions with coefficients in F. 


Examples of fields having a unique ordering are ©, R, and any number field that has only 
one real conjugate field. 

The condition that F is a subfield of R in Artin’s theorem can be replaced by the hypothesis 
that F is archimedean ordered. It is easily seen that this condition is equivalent to the 
assumption that F c R of Artin’s theorem. We shall prove a result that is somewhat stronger 
than Artin’s in that the archimedean property of F will be replaced by a condition of density in 
the real closure. If F is a subfield of an ordered field Z, then F is said to be dense in E if for 
any two elements a < b in E there exists a c € f such that a < c < b. It is easily seen that © is 
dense in this sense in R and this implies that any subfield of R is dense in R. Hence the 
following theorem is a generalization of Artin’s theorem. 


THEOREM 11.8. Let F be an ordered field such that (1) F has a unique ordering and (2) 
F is dense in its real closure. Let f be a rational function with coefficients in F such that 


f(aj5---54,) > 0 for all (a,...,4,) E€ F® for which f is defined. Thenf is a sum of squares of 
rational functions with coefficients in F. 


We shall give a model theoretic type of proof of Theorem 11.8 based on the following 


result, which was proved in BAI, p. 340. 

Let R, and R, be real closed fields having a common ordered subfield F, that is, the 
orderings on F induced by R, and R, are identical. Suppose that we have a finite set S of 
polynomial equations, inequations (f + 0), and inequalities (f> 0) with coefficients in F. Then 
S has a solution in R, if and only if it has a solution in R}. 

We now proceed to the 


Proof of Theorem 11.8. The set of rational functions of n variables with coefficients in F form 
a field with respect to the usual definitions of addition and multiplication. If p, 1 <i < n, 


denotes the function such that P(a, ...,a,,) = a;, then we have an isomorphism of the field F(x, 
...X,), X; indeterminates, onto the field of rational functions such that x; + p;. Accordingly, the 
latter field is F(p,....p,). Suppose that f e F(pj,...,p,) is not a sum of squares. Then by 
Theorem 11.5, there exists an ordering of F(p),...,p,,) Such that f< 0. Write f= gh” ' where g, h 
Ee F[pj,....p,]. Then gh < 0 so if k(x... Xp) = 8X15- -X AX]. Xp), then (p1,....p,,) < 0 and 
the inequality k(x),...,x,,) < 0 has the solution (p),...,p,,) in F(p... Pn) and a fortiori in the real 
closure R of F(p),....p,). Now consider the real closure Rọ of F. Since F has a unique 
ordering, the orderings of F in Rp and in R are identical. Moreover,hk(x),...,x,) € F[X1,.-..X,]- 
Hence by the result quoted, there exist a; € Rọ such that k(a),...,a,) < O and hence such that 
f(a,,...,a,) < 0. The proof will be completed by showing that the a; can be chosen in F. 


LEMMA. Let F be an ordered field that is dense in its real closure R and suppose that for 
K(X1,---X)) E R[X1,...,x,|there exist a; € R such that k(a,,...,a,,) < 0. Then there exist b; e F 


such that k(b,,...,b,,) < O. 


Proof. We use induction on n. If n = 1, let a' be chosen < a ( = a,) so that the interval [a’ , a] 


contains no root of k. Then k(x) < 0 for all x in [a’ a] (BAI, p. 310) and we may choose x = b € 
F in [a' , a]. Then k(b) < 0. Now assume the result for n — 1 variables. Then the one-variable 
case shows that there exists a b} € F such that k(b),a5,...,a,) < 0 and the n — 1-variable case 


implies that there exist b5,...,b,, such that k(b,,b5,...,5,) <0. D 


Remark. It has been shown by K. McKenna that the following converse of Theorem 11.8 
holds: If F is an ordered field such that any rational function f that is positive semi-definite in 
the sense that f(a),...,a,,) = 0 for all (a),...,a,) for which F is defined is a sum of squares, then 


F is uniquely ordered and is dense in its real closure. 
EXERCISES 


1. (J. Keisler.) Let F be ordered and let the extension field F(x), x transcendental, be 
ordered as in exercise 4, p. 634. Show that F(x) is not dense in its real closure by 


showing that there is no element of F(x) in the interval [V%2V*). 


2. Let R be real closed and let f(x) € R[x] satisfy f(a) = 0 for all a e R. Show that f(x) is a 
sum of two squares in R[x]. 


3. (C. Procesi.) Let R be a real closed field and let V be an irreducible algebraic variety 
defined over R, F the field of rational functions on V (see exercise 7, pp. 429-430). Let 
h,...,h, E€ F and let X be the set of points p of V such that hp) = 0. Show that if g € F 


satisfies g(p) > 0 for all p € X on which g is defined, then g has the form 
X$ „ihi th; 
1 n ii i J 


where >” indicates summation on the indices between 1 and k in strictly increasing order 


and the s; ...i; are sums of squares in F. 


(Sketch of proof. The conclusion is equivalent to the following: g is a sum of squares 
in F, = Fv"), There is no loss in generality in assuming that g and the h; are 
polynomials in the coordinate functions (p, ..., p,,)(as in the proof of Artin’s theorem), 
that is, we have polynomials g(x,.. Xn) Aj(Xp. .X,), 1 <j < k, in indeterminates x; with 
coefficients in R such that g = g(p,....P_), A; = 4(Pp----Pn)- If g is not a sum of squares in 
F; then there exists an ordering of F} such that g < 0. Let f,(x,...,%,),---Ap(Xp---X,) be 
generators of the prime ideal in R[x,...,x,,] defining V. Then we have /{(p,....p,) = 9, 
-<fn Pp: Pn) = 9, A(Pp-- Pp) = 9,---Ag(Pp-- Pn) = 0, Z(Pp----Pn) < 9 in F and hence 
ina real closure R, of F}. Consequently, we have (a,,...,a,,),a; E R, such that f;(a,...,a,) 
= 0,...f,(ap--d,) = O0A(a,....a,) 2 90,...,A,(a,...,.4,) Z 0,g(a,,....4,) < 0 This 
contradicts the hypothesis) 

4. (J. J. Sylvester.) Let f(x) € R[x], R real closed, x an indeterminate, and let A = 
R[x\/((x)). Let T(a, b) be the trace bilinear form on A/R and Q(a) = T(a, a) the 
corresponding quadratic form. Show that the number of distinct roots of f(x) in R is the 
signature of Q (BAI, p. 359). 

5. Let F be a field, f(x) a monic polynomial in F[x], and let f(x) =! &-” in a splitting field 
EIF of f(x). Put S; = $”; = 1 rj, so s; € F. Then the matrix 


So Sy Sn- 1 
Sy S3 Sn 
Sn-1 Sn S2n-2 


is called the Bezoutiant of f(x). Show that the determinant of the Bezoutiant is the 
discriminant of the trace form 7(a,b) on A = F[x]/(/(x)) determined by the base T, X,..., 
X”! where ṣi} = x + (f(x). 


6. (Sylvester.) Notations as in exercises 4 and 5. Let b, denote the sum of the k rowed 
principal (=diagonal) minors of the Bézoutiant of f(x) (hence, the characteristic 
polynomial of the Bézoutiant is x” — b,x"~!+... + (— 1)"b,,). Show that all the roots of 
f(x) are in R if and only if every b; > 0. 


7. (Procesi.) Let R be a real closed field and let g be a symmetric rational function of n 
variables with coefficients in R such that g(aj,...,a,) > 0 for all (aj,...,a,) € R” on 
which g is defined. Let b, denote the sum of the k-rowed minors of the Bézoutiant of f(x) 


= []i@—-p), Show that g has the form $? Sa ... i, bj, ... bp where >” indicates 
summation on the indices between 1 and & in strictly increasing order and the S, ... i, 


are sums of squares of symmetric functions. 


11.5 PFISTER THEORY OF QUADRATIC FORMS 


If an element of a field is a sum of squares, can we assert that it is a sum of n squares for a 
specified n? It is not difficult to see that if R is a real closed field, then any element of R(x) that 
is a sum of squares is a sum of two squares. Hilbert showed that if a rational function of two 
variables over R is positive semi-definite, then it is a sum of four squares. In the next section 
we Shall prove Pfister’s theorem that if R is real closed, any element of R(x,..., x,,) that is a 
sum of squares is a sum of 2” squares. We shall also sketch in the exercises of section 11.6 a 
proof of a theorem of Cassels that there exist elements in R(x),..., x,,) that are sums of squares 
but cannot be written as sums of fewer than n squares. The exact value of the number k(n) such 
that every element in R(x,,..., x,,) that is a sum of squares is a sum of k(n) squares is at present 
unknown. The results we have indicated give the inequalities n < k(n) < 2”. 

The proof of Pfister’s theorem is based on some results on quadratic forms that are of 
considerable independent interest. We devote this section to the exposition of these results. 

We deal exclusively with quadratic forms Q on finite dimensional vector spaces V over a 
field F of characteristic + 2. As usual, we write B(x, y) = OC x + y ) — Q(x) — QV). Since the 
characteristic is £ 2, it is preferable to replace B(x, y) by Q(x, y) = 4 B(x, y). Then Q(x, x) = 
Q(x). We shall now indicate that V has an orthogonal base (u),..., u,,) relative to Q such that 
O(u;) = b; by writing 


(3) Q ~ diag {bi, b2,..., Dn}. 
We shall also write 


(4) diag {b,,...,b,} ~ diag {cy,..., Cn} 


if the quadratic forms )”", bx? and >”, cx? are equivalent. 


We now introduce some concepts and results on quadratic forms that we have not 
considered before. First, we consider the tensor product of quadratic forms. Let V, i = 1, 2, be 


a vector space over F, Q; a quadratic form on V;. We shall show that there is a unique 
quadratic form Q; ® Q, on V; ® V, such that 


(5) (Q1 ® Q2)(v1 @ v2) = Qi(01)Q2(v2) 


for v; € V;. Let (uy,..., u„® ) be a base for V/F. Then (u;\) ® u,®) is a base for V, ® V, and 
we have a quadratic form Q on V; ® V, such that 


(6) Qu Q uf, uk! O uP) = Qi luf”, uk?) Qu, ul”). 
Ifv = Zau” and vy = Yb.) then 

Olvi ® v2) = Q(v; Q v2, v1 @ v2) 
QÙ aibu” Q uP, Y abu Q u”) 


2 2 
© ajay bjbiQ, (uh, ut”) Qalu, uy”) 
i,j,k,l 


by aia Qi (u$, ug) È bjbiQ2(u\”, uy’) 
= Q;(v1)Q2(v2). 


Putting O = QO, ® Q, we have (5) and since the vectors v; ® v, span V; ® V>, it is clear that Q; 
® Q, is unique. 


If (u;),...;u,) is an orthogonal base for V; then (uj) ® u,®) is an orthogonal base for V, 
® V, and if 


Q; ~ diag {b{’,..., bi}, 
then 
01 ® Os ~ diag {bP bP, ... DY BP; bY bP, ..., DY b2): 
(7) vin ae ee BPDP) 
= diag {b{P,...,b®} @ diag {b®,..., b2} 


where the tensor product of matrices is as defined on p. 250. 
In a similar manner, we can define the tensor product of more than two quadratic forms and 


we have the following generalization of (7): If Q; ~ diag {b,,...,b,,,), then 


(8) 0:0: QQ, ~ diag {by!,..., bY} OO diag {bY’,..., b0. 


It is also convenient to define the direct sum of two quadratic forms. If Q, i = 1, 2, is a 
quadratic form on V; then Q, ® Q; is defined to be the quadratic form on V, ® V, such that 


(Q1 ®Q2)(v1, V2) = Q1 (v1) + Qo(v2). 


It is clear that QO, ® Q, is well-defined and if V} and V, are identified in the usual way with 
subspaces of V, ® V>, then these subspaces are orthogonal relative to the bilinear form of QO; ® 
Qz. If (u,,...,u,,) is an orthogonal base for V;, then (u;,...,u,,;),u,,...,u,.) is an 
orthogonal base for V4 ® V>. We have 


O) 01 ®Q ~ diag {b,..., B, 9P..., B2) 


B 0 
if Q; ~ diag {b,,..., b,}. If B and C are matrices, it is convenient to denote the matrix (0 c) 
by B ® C. Using this notation we can rewrite 


. l 1)) 2 2 
Q: @Q) ~ diag {b\’,..., b6} @diag {bY??,..., bP. 


We now consider Pfister’s results on quadratic forms that yield the theorem on sums of 
squares stated at the beginning of the section. Our starting point is a weak generalization of A. 
Hurwitz’s theorem on sums of squares (see BAI, pp. 438—451). Hurwitz proved that there exist 
identities of the form 21° "219 = Lt=" where the z; depend bilinearly on the x’s and the y’s if 
and only ifn = 1, 2,4 or 8. Pfister has shown that for any n that is a power of two, the product 
of any two sums of squares in a field F is a sum of squares in F. Thus at the expense of 
dropping the requirement that the z; depend bilinearly on the x’s and y’s, we have 


(Li WLI) = ETE? that is a power of two. More generally, we consider quadratic forms Q that 
are multiplicative in the sense that given any two vectors x and y there exists a vector z such 
that O(x)O(v) = Q(z). A stronger condition on Q is given in 


DEFINITION 11.4. A quadratic form Q is said to be strongly multiplicative if Q is 
equivalent to cQ for any c £ 0 represented by Q. 


This means that there exists a bijective linear transformation 7 of V such that cO(x) = O(7x) 
for all x. Then if Q) = c, O(x)Q(y) = Q(z) for z = nx; hence Q strongly multiplicative implies 
O multiplicative. If O is of maximal Witt index (BAI, p. 369) on an even dimensional space, 
then O can be given coordinate-wise as 2**°—L#*:*/ it is easily seent that Q is strongly 
multiplicative. On the other hand, the quadratic form Q = x? — x22 — x;* on a three-dimensional 


vector space over R is multiplicative but not strongly multiplicative. The fact that it is 
multiplicative follows from the observation that any form that is universal has this property. On 
the other hand, Q is not equivalent to — Q by Sylvester’s theorem (BAI, p. 359). For any 
quadratic form Q on a vector space V/F we let Fg denote the set of non-zero elements of F 


represented by Q. It is clear that F*Q is closed under multiplication if and only if Q is 
multiplicative. Since Q(x) = c £0 implies O(c 1x) = c !, it is clear that O is multiplicative if 
and only if F*ọ is a subgroup of F*. 

We proceed to derive Pfister’s results. We give first a criterion for equivalence in the 
binary case. 


LEMMA 1. diag {b, b>} ~ diag {c, c2} if and only if c, is represented by bix? + bx and 
bby and c;c, differ by a square factor. 


Proof. Since b,b, is a discriminant of Q = b,x? + bx% and the b; are represented by Q, it is 
clear that the conditions are necessary. Now suppose they hold. By the first condition we have 
a vector y such that O(y) = c}. Then diag {b}, by} ~ diag {c}, c} and cc =k°b b>, k e F*. Also 
C,C, = Êb b». Hence c, = n?c and we can replace c by cz. Thus diag {b,),! ~ diag {c}, co}. O 


We prove next the key lemma: 


LEMMA 2. Let Q be a strongly multiplicative quadratic form, a an element of F*. Let Q, ~ 
diag {1, a}. Then Q, ® Q is strongly multiplicative. 


Proof. Itis clear that Q, ® Q is equivalent to Q ® aQ. Hence, it suffices to show that the 
latter is strongly multiplicative. We now use the notation ~ also for equivalence of quadratic 
forms and if Q} ~ diag {a, b}, then we denote QO, ® Q, by diag {a, b] ® Q, ~ aQ, ® bQ,. Let k 
be an element of F* represented by Q ® aQ, so k = b + ac where b and c are represented by Q 
(possibly trivially if b or c is 0). We distinguish three cases: 

Case I. c = 0. Then k = b and Q ~ bQ. Hence Q ® aQ ~ bO ® abO = b(O ® aQ) = k(O ® 
aQ). 

Case Il. b= 0. Then k = ac and k(O ® aQ) = kQ ® kaQ = acQ ® a°cỌ ~ aQ ® QO since cQ 
~ Ọ by hypothesis and Q ~ a’Q for any a e F*. Thus k(O ® aQ)~ O ® aQ. 

Case Ill. bc + 0. We have 2842 ~ /2@acQ ~ diag {b, ac} 8. Since k = b + ac is represented by 
bx)? + acx,* and bac and k°abc differ by a square, it follows from Lemma | that diag {b, ac} ~ 
diag {k, kabc}. Hence diag {b, ac] ® Q ~ diag {k, kabc} ® O~k diag {1, abc} ® Q ~ kQ ® 
kabcO~kQ ® kaQ=k(Q ® aQ). 


In all cases we have that 222 ~ Qa), so Q ® aQ is strongly multiplicative. D 


It is clear that the quadratic form Qg = x? ~ diag {1} is strongly multiplicative. Hence 
iterated application of Lemma 2 gives 


THEOREM 11.9. Ifthe a;€ F*, then 


(10) Q ~ diag {1, a;} ® ++: @diag {1, an} 


is a strongly multiplicative quadratic form. In particular, E? xè ~ diag {1,1} © + @diag {1,1 (n 
factors) is strongly multiplicative. 


We shall call the forms given in Theorem 11.9 Pfister forms of dimension 2”. We prove 
next a type of factorization theorem for such forms. 


THEOREM 11.10. Write 
(11) diag {1, a;} Q Q diag {1, an} = diag {1} @D 


and let Q be a quadratic form such that Q ~ D. Suppose that b, + 0 is represented by Q. 
Then there exist by,..., b, E€ F* such that 


(12) diag {1,a;} ® ++: ® diag {1,a,' ~ diag {1,b,} ® ++: @ diag {1, bn}. 


Proof. By induction on n. If n = 1, then b, = a,c? and hence diag{l, a,} ~ diag {1, b,;}. Now 
assume the result for n and consider 


(13) diag {1,a;} ® = @ diag {1, a} ® diag {1, a} = diag {1} @ D'. 


Suppose that b, # 0 is represented by Q' such that Q' ~ D' . Then b; = b’, + ab where b' , is 
represented by Q ~ D and b is represented by x? ® Ọ ~ diag {1, a;} ® ... @ diag {1, a,}. 


Case I. b = 0. Then b'# 0 and the induction hypothesis gives elements b,...,b, E€ F* such 
that (12) holds. Then 


diag {1, a1} Q ++: @ diag {1, an} ® diag {1, a} 
~ diag {1,b;} Q ++: @ diag {1, ba} ® diag {1, a}. 
Case Il. b' ; = 0. Then b £ 0, b; = ab, and 


diag {1,a,;} ++: ® diag {1, a} Q diag {1, a} 
~ (diag {1,a,} ® + ® diag {1, a,}) @ a(diag {1, a,} ® +++ ® diag {1, a,}) 
~ (diag {1,a,} Q= ® diag {1, a,}) ® ab(diag {1, a,} 
®-+@diag{1,a,') | (Theorem 11.9) 
~ diag {1, ab} ® diag {1,a,} ® = Q diag {1, an}. 


Case Ill. bb' , # 0. The equivalence established in Case II permits us to replace a by ab. 
Then by Case I we have by,...,b,, € F* such that 


diag {1,a,;} ® ++: @ diag {1,a,} Q diag {1, ab} 
~ diag {1, bi} ® diag {1, b2} ® ++: ® diag {1, bn} ® diag {1, ab}. 
Now 
diag {1, ab} ® diag {1, b1} ~ diag {1, ab, bi, abbi } 
~ diag {1, abb; } @diag {b}, ab} 
~ diag {1, abb; @diag{b;,b,abb;} (Lemma 1) 
~ diag {1, bı} Q diag {1, bi ab}. 
Substituting this in the first equivalence displayed, we obtain the result in this case. 0 


We are now ready to derive the main result, which concerns the representation of sums of 
squares by values of Pfister forms. 


THEOREM 11.11. Suppose that every Pfister form of dimension 2" represents every non- 
zero sum of two squares in F Then every Pfister form of dimension 2” represents every non- 
zero sum of k squares in F for arbitrary k. 


Proof. By induction on k. Since any Pfister form represents 1, the case k = 1 is clear and the 
case k = 2 is our hypothesis. Now assume the result for k > 2. It suffices to show that if Q is a 
Pfister form of dimension 2” and a is a sum of k squares such that c = 1 + a # 0, then c is 
represented by Q. This will follow if we can show that O ® — cQ represents 0 non-trivially. 
For then we shall have vectors u and v such that Q(u) = cO(v) where either u + 0 or v 4 O. If 
either O(u) = 0 or O(v) = 0, then both are 0 and so Q represents 0 nontrivially. Then Q is 
universal and hence represents c. If Q(u) # 0 and O(v) + 0, then these are contained in F*g and 


hence c = O(u)O(v) | € F *o, SO c is represented by Q. We now write Q = x? © Q'. Since Q 
represents a, we have a = a? + a’ where ỌQ' represents a’ . We have diag 
(1, -} @Q~ 9@(-eQ) ~x7 @Q'@(—cQ) and POl- represents “7 l+a+a)= -U+a:) TF this is 0, then 
c = a' is represented by Q. Hence we may assume that 1 + a}? # 0. Then by Theorem 11.10, 
diag {1, — c} ® QO ~ diag {1, — 1 — af} ® Q" where O"is a Pfister form of dimension 2”. By 
the hypothesis, this represents 1 + a,7. It follows that diag {1, — 1 — a,*} ® Q" represents 0 non- 
trivially. Then Q ® — cQ ~ diag {1, — 1 —a,? } ® Q" represents 0 non-trivially. This completes 
the proof. O 


11.6 SUMS OF SQUARES IN R(,...,x,,), R A REAL CLOSED FIELD 


We now consider the field R(x,...,x,,) of rational expressions in n indeterminates x1,...,x,, over 


a real closed field R. We wish to show that Theorem 11.11 can be applied to the field F = 
R(x},...,X,) For this purpose we need to invoke a theorem that was proved by C. C. Tsen in 


1936 and was rediscovered by S. Lang in 1951. To state this we require the following 


DEFINITION 11.5. A field F is called a C,-field if for any positive integer d, any 


homogeneous polynomial f with coefficients in F of degree d in more than d’ indeterminates 
has a non-trivial zero in F®. 


By a non-trivial zero we mean an element (a;,...,ag) € F” such that (a;,.. ag) # (0, ..., 0) 
and f{a1,...,ag) = 0. The theorem we shall require is the 


THEOREM OF TSEN-LANG. [fF is algebraically closed and the x's are indeterminates, 
then F(x1,...,x,)is a C,-field. 


It is readily seen that any algebraically closed field is a Cg field. Now suppose that F is not 


algebraically closed. Then there exists an extension field E/F' such that [E:F] = n > 1 and n is 
finite. Let (ul, ..., u,) be a base for E/F and let p be the regular matrix representation 
determined by this base (BAI, p. 424). Then if “= Lam acF, Nee(u) = det (X ao(u)) #0 unless every 
a; = 0. Let x), ..., x, be indeterminates and put M(x, ..., x,) = det Ox p(u,)). This is a 
homogeneous polynomial of degree n and Ma, ..., a,) = Ngp(U)). Hence N has no zero 
except the trivial one (0, ..., 0) and hence F is not a Cọ-field. Thus a field is a Cọ-field if and 
only if it is algebraically closed. 

C|-fields were introduced by Artin, who called these fields guasi-algebraically closed. 
We have indicated in a series of exercises in BAI that any finite field is a C}-field (see 


exercises 2—7 on p. 137). 
The proof of the Tsen-Lang theorem is an inductive one. The proof of the initial step that if 
F is algebraically closed then F(x) is C; and the proof of the inductive step will make use of 


the following result, which is a consequence of Theorem 7.29, p. 453. 


LEMMA 1. Let F be an algebraically closed field, f,, ..., f, polynomials without constant 
terms in n indeterminates with coefficients in F If n > r, then the system of equations f(x), 


e Xn) = 0, ... fq, -e Xp) = 0 has a non-trivial solution in F” 


We need also an analogous result for systems of polynomial equations in a C,-field, which 
we derive below. First, we give the following 


DEFINITION 11.6. 4 polynomial with coefficients in F is called normic of order i ( > 0) 


for F if it is homogeneous of degree d> 1 in di indeterminates and has only the trivial zero 
in F. 


The argument used above shows that if F is not algebraically closed, then there exist normic 
polynomials of order 1 for F. If F is arbitrary and ¢ is an indeterminate, then it is readily seen 


that F(t) is not algebraically closed (for example, V'#F), Hence there exist normic 
polynomials of order 1 for F(t). We have also 


LEMMA 2. If there exists a normic polynomial of order i for F, then there exists a normic 
polynomial of order i + 1 for F(t), t an indeterminate. 


Proof. Let N(x), ..., xg) be a normic polynomial of order i for F. The degree of N is d. We 
claim that 


(14) N(xq,.... Xa) +N (Kare tyes X20)t + tN (Xe yat ty Xati! 


is a normic polynomial of order i + 1 for F(t). Since this polynomial is homo geneous of 
degree d in d! * ! indeterminates, it suffices to show that it has only the trivial zero in F(®. 
Hence suppose that (a), ..., ag! * ') is a non-trivial zero of (14). Since the polynomial is 
homogeneous, we may assume that the a; € F [t] and not every a; is divisible by ¢. Let a, be the 
first one that is not, and suppose that jd’ + 1 < k < (j + Id’. Then dividing by # gives the 
relation ‘V (4M te-+-+4u+ae) = “(mod ¢) and hence reducing modulo ¢ gives N (ajeti yrn = ¥ 
where a is the constant term of a. Since a, # 0, this contradicts the hypothesis that V is normic 


and completes the proof. D 


If N(x, ..., Xg) is a normic polynomial of order i and degree d, then 


(15) N(N(X1,. ' Aih N(Xa+ EE -s X24"), aN Oii Ei nny X42) 


is homogeneous of degree d? in d” indeterminates. Moreover, this has only the trivial zero in 
F. Hence (15) is normic. Since d > 1, we can apply this process to obtain from a given normic 
polynomial normic polynomials of arbitrarily high degree. We shall use this remark in the 
proof of an analogue of Lemma | for C;-fields: 


LEMMA 3 (Artin). Let F be a C,-field for which there exists a normic polynomial of order 
i(i>0). Let fi, ..., f, be homogeneous polynomials of degree d in the indeterminates x), ..., 


x, with coefficients in F If n > rd, then the f5 have a common non-trivial zero in F. 


Proof. Let N be a normic polynomial of order i for F. Suppose the degree of N is e. Then the 
number of indeterminates in N is e’, which we can write as et = rs + t where s > 0 and 0 <t< 
r. We replace the first r indeterminates in N by fi; ..., Xn) <- JAX] -< X,) respectively, the 


next r by f&n 4 15 +++) Xn) +o S (Xp + 1 +++ X2n), etc., and the last t indeterminates in N by 0’s. 
This gives the polynomial 


M = N(fi(X1, ae | A s.e Sel, a Xn), 


Hs lo: ES E T dot AT A AA UT A 


which is homogeneous in ns indeterminates. Moreover, deg M = ed. We want to have ns > (ed)! 
= d'(rs + t). This will be the case if (n — d'r)s > d't. Since n > dir and 0 < t < r, this can be 
arranged by choosing e large enough— which can be done by the remark preceding the lemma. 
With our choice of e we can conclude from the C;-property that M has a non-trivial zero in F. 


Since N is normic, it follows easily that the f; have a common non-trivial zero in F. O 


In the next lemma, for the sake of uniformity of statement, we adopt the convention that 1 is 
a normic polynomial of order 0. Then we have 


LEMMA 4. Let F be a C;field, i > 0, such that there exists a normic polynomial of order i 
for F Then F(t), t indeterminate, is a C; ,-feld. 


Proof. Let f(x), ..., X,.) be a homogeneous polynomial of degree d with coefficients in F(t) such 


that n > d'* 1. We have to show that f has a non-trivial zero in F(t). There is no loss in 
generality in assuming that the coefficients of fare polynomials in t. Let r be the degree of f in 
t. Put 


Xj=XjotXplto + Xp’, | Sj Sn, 
where the x;, are indeterminates. Then 
. A A _ _— n sd + 
F(X1,---5Xn) = fo+ fit + + fatt d 


where the f; are homogeneous polynomials of degree d in the n(s + 1) x;, with coefficients in F. 


The polynomial f will have a non-trivial zero in F(t) if the f; have a common non-trivial zero in 


F. By Lemmas 1 and 3 this will be the case if n(s + 1) > (sd + r + ld! Since n > d! * !, the 
inequality can be satisfied by taking s sufficiently large. D 


The proof of the Tsen-Lang theorem is now clear. First, Lemma 4 with i = 0 shows that if F 
is algebraically closed, then F(x) is a C,-field. Next, iterated application of Lemma 2 shows 


that there exists a normic polynomial of order i for F(x), ..., x;). Then iterated application of 
Lemma 4 implies that F(x), ..., x,) is a C,-field. OD 


We can apply this result for the case d = 2 to conclude that if F is an algebraically closed 
field, then any quadratic form on a vector space V of 2” * ! dimensions over F(x), ..., X,) 


represents 0 non-trivially. It follows that any quadratic form on a vector space of 2” 
dimensions over F(x), ..., X„) is universal. We shall make use of this result in the proof of 


THEOREM 11.12. Let R be a real closed field and let Ọ be a Pfister form on a 2"- 
dimensional vector space over the field R(x), ..., X,). Then Q represents every non-zero sum 


of two squares in R(x, ..., Xp). 


Proof. Let Q be a Pfister form on a 2”-dimensional vector space V over R(x), ..., x,,.). We have 
to show that if b = b? + b? + 0, b; e R(x, ..., x,), then b is represented by Q. Since Q 
represents 1, the result is clear if b}b, = 0. Hence we assume b,b, + 0. Let C = R(i), i? 

and consider the extension field C(x), ..., x,) of R(x), ..., x,) and the vector space 
P= Vote cap SER XI mia M enea) ig a base for C/R, then this is a base for C(x), ..., Xp) 
over -i ..., X,). Moreover, every element of P can be written in one and only one way as 


eiu, +ezuz, MEV (identified with a subspace of P in the usual way). The quadratic form Q has a 
unique extension to a quadratic form @ on P. Evidently @ is a Pfister form. Now put q = b; + 


bi. Then (1 ,q) is a base for C/R and q —2biq+b = q—2bia+(b:?+bs*)=0. There exists a vector ü = 
u, + quo, u; E V, such that O (ï) = q. Then Olu) + 2gO(uy, uy) + g?O(uy) = q. Since (1 ,g) is a 
base for C(x}, ..., X RŒ, «.., x,) and q? — 2b,q + b = 0, this implies that O(u,) = bO(uy). It 
follows that b is represented by Q. O 


If we combine Artin’s theorem (p. 640) with Theorems 11.11 and 11.12, we obtain the main 
result: 


THEOREM 11.13. Let R be a real closed field. Then any positive semi- definite rational 
function of n variables over R is a sum of 2” squares. 


More generally, the same theorems show that any positive semi-definite rational function of 
n variables in R can be represented by a Pfister form of dimension 2”. It is noteworthy that to 


prove Theorem 11.13 we had to use Pfister forms other than the sum of squares. 


EXERCISES 


1. (J. W. S. Cassels.) Let F be a field of characteristic # 2, x an indeterminate. Let p(x) € 
F[x] be a sum ofn squares in F(x). Show that p(x) is a sum ofn squares in F[x]. 


(Outline of proof The result is clear if p = 0. Also if- 1 = }”, a; where the a; € F, 


then 
_ (vt! & (alp-)Y 
(F) +E ( 2 ) 


Hence we may assume that p # 0 and that Q = }¥”} x7 is anisotropic in F and hence in 
F(x). We have polynomials A&A), ...,./,(x) such that f(x) # 0 and 


(16) p(x) folx)? = filx)? + 0° + fal). 


Let fo(x) be of minimum degree for such polynomials and assume that deg fo(x) > 0 
write. 


(17) fix) = fo(x)gi(x)+ni(x), 0 <i <n, 


where deg r(x) < deg f(x) (we take deg 0 = — œ). Let Q' be the quadratic form such that 
O'~ diag { —p, 1,..., 1} with n 1’s. Then (16) gives O(f) = 0 where f= (fo... fn). The 
result holds if Og) = 0 for g = (gp = 1,..., g,,), so assume that Q’ (g) # 0. This implies 


that f and g are linearly independent over F(x). Then h = O'(g,g)f — 20(f,.g)g + 0 and Q' 
(h) = 0. If hg = 0, then Q represents 0 nontrivially in F(x), hence in F contrary to the 


hypothesis. Hence hy # 0. The h; are polynomials in x and 


/ a . l n . a l a 2 
ho = Q'(9, 9) fo-20(f, 9) = = O'(fog-f) = = X (ri(x))*. 
fo Joi 


Then deg hy < deg fy , which is a contradiction.) 


2. (Cassels.) Let F, x be as in exercise 1 and let x? + d, for d € F, be a sum ofn > 1 squares 
in F(x). Show that either — 1 is a sum of n — 1 squares in F or d is a sum of n squares in 
F. 


3. (Cassels.) Use exercises 1 and 2 to show that if R is real closed, then x,7 + ... + x,,7 is 
not a sum ofn — 1 squares in R(x), ..., X,). 


4. (T. Motzkin.) Let R be real closed, x and y indeterminates, and let 


p(x, y) = 1+x?(x?—3)y? +x? yt. 


Verify that 


> 


(1—x?y?)? +.x2(1—y?)? +.x2(1— x? y? 
1+x? 


p(x, y) = 


(1—x?— 2x? y?)? +(x(1—x?)y)? F S = T 


7 (1+x?) 


so p(x, y) is a sum of four squares in R(x)[y]. Show that p(x, y) is not a sum of squares in 
R[x, y]. 
5. Show that any algebraic extension ofa C;-field is a C;-field. 


11.7 ARTIN-SCHREIER CHARACTERIZATION OF REAL CLOSED 
FIELDS 


We conclude our discussion of real closed fields by establishing a beautiful characterization of 
these fields that is due to Artin and Schreier. We have seen that a field R is real closed if and 
only if V-! É R and C = R(v —!) is algebraically closed (Theorem 11.3, p. 634). Artin and 
Schreier prove the following considerably stronger theorem: 


THEOREM 11.14. Let C be an algebraically closed field, R a proper subfield of finite 
codimension in C ([C: R] < œ). Then R is real closed and C= R(¥ ~'). 


We prove first some elementary lemmas on fields of characteristic p # 0. The first two of 
these were given in BAI, but for convenience we record again the statements and proofs. 


LEMMA 1. Let F be a field of characteristic p, a an element of F that is not a pth power. 
Then for any e > 1, the polynomial x?° — a is irreducible in F{x]. 


Proof. If E is a splitting field of x?° — a, then we have the factorization x°° — a = (x — r)P in 
E{x]. Hence if g(x) is a monic factor of x?° — a in F[x], then g(x) = (x — r)*, k = deg g(x). Then 
r% e F and P? =a e F. If p! = (p°, k) there exist integers m and n such that p = mp® + nk. Then 
Pf = =" PPV EF If k< p°, then f< e and if b = m, then b?* ‘= a, contrary to the hypothesis 
that a is not a pth power in F. O 

LEMMA 2. Jf F is a field of characteristic p and a E F is not of the form uP —u,ueé F, 
then xP —x —a is irreducible in F{x]. 


Proof. Ifr is a root of x” —x—ain E[x], E a splitting field, thenr+1,7r+2,...,r+(p-1) 


are also roots of x? — x — a. Hence we have the factorization 


jaj 
x?—x—a= || (x-r-i) 
i=0 


in Efx]. If g(x) = x* — bxf =! + ... is a factor of x? — x — a, then kr + /1 = b where / is an 


integer. Hence k < p implies that r e F. Since 7? — r =a, this contradicts the hypothesis. O 


LEMMA 3. Let F and a be as in Lemma 2 and let E be a splitting field for x? — x — a. Then 
there exists an extension field K/E such that [K:E] = p. (Compare Theorem 8.32, p. 510.) 


Proof. We have E = F(r) where P =r + a. We claim that the element ar? ~ ! € E is not of the 
form u? — u, u € E. For, we can write any u as ug + uyr +... + u, _ l u; € F, and the 


condition u? — u = ar? ~! and the relation” = r + agive 
gl 


Uo? +uP(r+a)+ u(r +a} + i; ir +a)! 


— Ug — ut — t — Up rP* = ar?" 
Since (1, r, ..., 7? ~ ') is a base, this gives the relation uP, 1 — Uy _ 1 =a, contrary to the 


hypothesis on a. It now follows from Lemma 2 that x? — x — ar? ~ ! is irreducible in E[x]. 


Hence if K is the splitting field over E of this polynomial, then [K:E] =p O 


We can now give the 


Proof of Theorem 11.14. Let C' = R(w ~!) c C. We shall show that C' = C. Then the result will 
follow from Theorem 11.3. Now C is the algebraic closure of C; hence any algebraic extension 
of C' is isomorphic to a subfield of C/C’ and so its dimensionality is bounded by [C:C’]. The 
first conclusion we can draw from this is that C’ is perfect. Otherwise, the characteristic is a 
prime p and C' contains an element that is not a pth power in C’. Then by Lemma 1, there exists 
an algebraic extension of C' that is p°-dimensional for any e > 1. Since this has been ruled out, 
we see that C' is perfect. Then C is separable algebraic over C’ and since C is algebraically 
closed, C is finite dimensional Galois over C’. Let G = Gal C/C’, so |G| = [C: C’ ]. 


Now suppose that C # C’. Then |G| + 1. Let p be a prime divisor of |G|. Then G contains a 
cyclic subgroup H of order p. If E is the subfield of H-fixed elements, then C is p-dimensional 
cyclic over E. If p were the characteristic, then C = E(r) where the minimum polynomial of r 
over E has the form x? — x — a(BAI, p. 308). Then by Lemma 3, there exists a p-dimensional 
extension field of C. This contradicts the fact that C is algebraically closed. Hence the 
characteristic is not p, and since C is algebraically closed, C contains p distinct pth roots of 
unity. These are roots of x? — 1 = (x —l)(x? 7! + xP ~ 2+... + 1) and since the irreducible 
polynomials in E[x] have degree dividing [C: E] = p, it follows that the irreducible factors of 
xP — l in E[x] are linear and hence the p pth roots of unity are contained in E. Then C = E(r) 


where the minimum polynomial of r over E is x” — a, a € E (BAI, p. 308). Now consider the 
polynomial x”? — a. This factors as [l= where u is a primitive p?-root of unity and s = 
a. If any u's € E, then (u's)? € E and ((u's)?P = a, contrary to the irreducibility of x? — a in 
E[x]. It follows that the irreducible factors of x?? — a in Efx] are of degree p. If b is the 
constant term of one of these, then b = s?v where v is a power of u. Since (s? ? = a, sP € E and 
since [C: E] = p, C = E(s?) = E(bs — p) = E(v). Since E contains all the pth roots of unity, it 
follows that v is a primitive p-root of unity. 

Let P be the prime field of C and consider the subfield P(v) of C. If P=@, we know that the 
cyclotomic field of p’th roots of unity has dimensionality g(p") over @ (BAI, p. 272). This 
number goes to infinity with r. If P is of finite characteristic /, then we have seen that / + p. 


Then the field of p’th roots of unity over P contains at least p” elements, so again its 
dimensionality over P tends to infinity with r. Thus in both cases it follows that there exists an 
r such that P(v) contains a primitive p’th root of unity but no primitive p™*! st root of unity. 
Since v is a primitive p?-root of unity, r > 2. The field C contains an element w that is a 
primitive p’*! st root of unity. We now consider the cyclotomic field P(w). Let K = Gal P(w)/P. 
If P is finite, then we know that K is cyclic (BAI, p. 291). The same thing holds if P = @ unless 
p=2 andr > 2. If K is cyclic, then it has only one subgroup of order p and hence, by the Galois 
correspondence, P(w) contains only one subfield over which it is p-dimensional. We shall now 
show that P(w) has two such subfields. This will imply that p = 2 and the characteristic is 0. 

Let h(x) be the minimum polynomial of w over E. Since v € E, w € E and C = E(w). Hence 
deg h(x) = p. Moreover, A(x) is a divisor of P= Ne e- so the coefficients of h(x) 
are contained in the subfield D = E N P(w). It follows that [P(w):D] = p. Next consider the 
subfield D’ = P(z) where z = w”. The element w is a root of x? — z € D'[x] and this polynomial 
is either irreducible or it has a root in D' (BAI, exercise 1 on p. 248). In the first case [P(w):D 
"| = p. In the second case, since z is a primitive p’th root of unity, D' contains p distinct pth 
roots of unity and hence x” — z is a product of linear factors in D’[x]. Then w € D' and D' = 
P(w). But P(v) contains a primitive p’th root of unity and hence P(v) contains z, so if D' = 
P(w), then P(v) will contain a primitive p” * 'st root of unity w, contrary to the choice of r. 
Thus [P(w): D'] = p. Now D' + D. Otherwise, D contains a primitive p’th root of unity and E 
contains a primitive p’th root of unity. Then E contains w, contrary to the fact that [C:E] = 
[E(v):E] = p. Thus D and D' are distinct subfields of P(w) of codimension p in P(w). As we 
saw, this implies that the characteristic is 0 and p = 2. Now C= E(v) and v is a primitive 27 = 
Ath root of unity. Hence v = + ¥ ~!. Since E contains Y~', we contradict [C: E] = 2. This 
completes the proof. O 
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